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One up, one down 


Sending satellites to monitor the atmosphere is fundamental to weather and climate research. 
So why is the United States making such a meal of it? 


upper-atmosphere research satellite rained into the Pacific 

Ocean, the United States is scheduled to send a new mission 
into polar orbit. A joint project of NASA and the National Oceanic and 
Atmospheric Administration (NOAA), the spacecraft is to provide 
basic information for meteorologists. It will also ensure a continuing 
flow of data to climate researchers, as they have come to expect. 

There should be nothing remarkable about this. But the truth is that 
the scheduled 25 October launch of the awkwardly named NPOESS 
Preparatory Project (NPP) represents a landmark achievement for 
the United States’ long-troubled polar satellite programme. As recent 
discussions on Capitol Hill have made apparent, it remains utterly 
unclear what will follow the launch, and when. 

This odd situation has its roots in the 1994 decision by the 
administration of president Bill Clinton to merge separate defence 
and civilian programmes into a single National Polar-orbiting Opera- 
tional Environmental Satellite System (NPOESS). 

What seemed a logical decision instead saw the programme 
spiral into a bureaucratic nightmare involving NOAA, NASA and the 
Department of Defense. In the end, it fell to President Barack Obama 
to dismantle the programme entirely and start again. More than a year 
later, NOAA and NASA are restoring order to the newly restructured 
Joint Polar Satellite System. (The same cannot be said for a separate 
component retained by the defence department, which remains 
in limbo.) 

The challenge now, of course, is money. Overall programme costs 
have more than doubled the original US$7-billion estimate, with 
NOAA alone predicting costs of nearly $12 billion to 2024. Even with 
this, expected goals have been scaled back and delayed. 

With funding flat for the programme in 2011, NOAA now wants 
$1.1 billion in fiscal year 2012, an increase of $688 million, to get the 
next polar satellite ready for launch in late 2016 or early 2017. And 
if Congress fails to provide — well, the agency threatens, Ameri- 
cans will have to suffer the consequences of a significantly degraded 
weather-forecast system. Citing storms such as Hurricane Irene, which 
hammered the US East Coast in August, NOAA assistant admin- 
istrator Kathryn Sullivan says that these satellites provide “critical 
environmental intelligence”. 

All of this leaves lawmakers on Capitol Hill rightly confused. 
Democrats and Republicans alike support the programme, yet are 
understandably wary and angry about NOAA% ongoing failure to pro- 
vide a complete rundown of the budget going forward. House and 
Senate appropriations committees did their best to find the money, but 
that doesn’t mean much ina volatile year such as this one. As Paul Broun, 
the Georgia Republican who chairs the House Science Subcommittee 
on Investigations and Oversight, warned NOAA last week, Congress 
might not pass any funding bills at all, and instead might maintain 
current spending levels into 2012 and perhaps 2013. And on top of 
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that, a high-profile congressional panel is busy looking for further 
savings to reduce the US debt. 

NOAA deserves credit for setting the satellite programme back on 
an even keel, but the agency will need to provide more than dire warn- 
ings of disaster and human suffering if the requested funding doesn’t 
come through. Nor does it help when the agency fails to meet legiti- 
mate demands for information from lawmakers — and the press. In 
fact, on the evidence of Nature’s rebuffed enquiries, it seems harder 

to get information out of NOAA than from 
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charge for some of the services it provides 
for other agencies and institutions. This might be an effective way to 
spread some of the pain, but if the agency moves in this direction it 
should be for value-added services only. The raw satellite data them- 
selves must remain freely available, both at home and abroad. 

In the meantime, fingers are being kept firmly crossed that the 
NPP satellite makes it safely into space and performs as well as its pre- 
decessors. Beyond that, well, NASA officials say they have no reason 
to believe that the satellite's sensors are going to suddenly shut down 
at the end of their mission life, as NOAA has implied in its push for a 
successor. Indeed, many satellites run for years beyond their scheduled 
lifetime. It would be foolhardy to run a programme on this basis, but 
it may yet come to that. = 


Blackened names 


Canada should make public the identities of 
researchers who commit misconduct. 


hen government agencies redact material from released 
Wests there are inevitably questions. That was cer- 
tainly the case on 19 September, when Margaret Munro, 
a reporter for Postmedia News in Canada, revealed details of several 
seemingly egregious cases of misconduct that she had obtained under 
Access to Information laws. 
The heavily censored documents, released by the Natural Sciences 
and Engineering Research Council of Canada (NSERC), the country’s 
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science-funding agency, described a case in which a researcher had 
faked data, then moved to another institution that may have known 
nothing of the episode. A second, even more remarkable, case involved 
a scientist who padded his CV with papers that did not exist anywhere 
in the published literature. 

But the names of the wrong-doers were blacked out, as were the 
citations of retracted papers and the details of the research fields 
affected. In some cases, information on the misconduct findings and 
the nature of the wrong-doing was also redacted. One document 
retained sentences ofa letter in which someone — presumably a mis- 
conduct perpetrator — offered abject apologies and argued that cutting 
off his research funding would harm his students, but blacked out the 
name and numbers of people involved. In another, a black rectangle 
obscured a list of six problematic journal articles, some of which are 
apparently fictitious. 

Canada’s practices take privacy concerns too far. There is an 
argument for confidentiality while an investigation is going on, or 
even after it has been resolved, if allegations turn out to be frivolous 
or malicious. In the United States, many agencies withhold the names 
of those accused of misconduct. Some officials argue that cases of 
honest error should also remain confidential, to protect the careers 
of scientists exonerated of misconduct. But these arguments cease to 
be relevant once misconduct has been found, as the US government’s 
Office of Research Integrity acknowledges when it makes public the 
names of those found guilty of misconduct. 

Taxpayers have a right to know about instances in which their 
money has been misused. Research misconduct can affect the reliabil- 
ity of the scientific literature, and other scientists and journal editors 
have a clear interest in knowing about it. And the government may be 


failing to protect its own interests if the names of parties disciplined 
by one branch are not available to others. 

Making it clear that names will always be kept confidential — as 
NSERC does — also increases the chance of leaks from understand- 
ably frustrated whistleblowers, and of the publishing of names through 
unofficial routes. Last week, for example, the blog of Retraction Watch, 

a US watchdog of the scientific literature, guessed 


“Taxpayers at the identity of one of the researchers referred 
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tutions, including one that Munro says had to 
be ‘reminded’ twice to investigate allegations 
after a researcher had left. Yet there is a clear public interest in knowing 
whether universities — which typically employ thousands of research- 
ers and educate tens of thousands of students — have received mis- 
conduct allegations, and how they have responded. 

Redacting the conclusions of an investigation or the actions taken 
in response to wrong-doing is even worse, arguably serving to protect 
not only the institutions, but the funding agency itself. For example, 
one document provided to Munro says that the funding agency has 
decreed “that [black smudge] be declared ineligible indefinitely to hold 
or apply for a [black smudge]”. It seems absurd to redact information 
about how misconduct has been punished. NSERC says that it takes 
misconduct seriously, that cases in Canada are rare and that its actions 
send a strong message about the consequences of misconduct. But 
without any details, the message lacks force. m 


beenmisused.” 


Heritage threats 


Research that underpins the preservation of 
cultural treasures should be championed. 


looks likely to have been a huge shipyard dating back to the 

second century AD, part of a complex of buildings at Portus, 
the ancient port of Rome. The discovery (see go.nature.com/qpbirv) 
is the latest in a major project to explore Portus using an array of 
geophysical and electromagnetic techniques. 

There are any number of similar science-based studies, projects and 
initiatives under way around the world, all intended to discover or 
preserve our heritage. As shown in a European web portal launched 
last week at www.heritageportal.eu, the sciences involved are drawn 
from across the disciplines. Projects include using climate models to 
establish which heritage sites are most sensitive to global warming, 
studies of dust and the effects of its transport in museums and libraries, 
semantic search techniques to link digital archives and grey literature, 
and atomic force microscopy to look for signs of decay in paper. 

No one could deny that such science is important, given the 
intrinsic interest of cultural heritage to citizens, not to mention 
its often sizeable value to economies. But the field of ‘heritage sci- 
ence’ is nevertheless beset by obstacles to its recognition. Because 
it lacks a disciplinary focus, no traditional university department 
or disciplinary funding agency readily sees it as their own. Projects 
are often small scale, and unlikely to reveal the fundamental scien- 
tific insights or spin-off products required in competitive funding 
proposals. 

Furthermore, even though national heritage sites and collections 
may flourish, the science that helps them do so only occasionally 


A rchaeologists last week announced the discovery of what 
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gets the public attention it deserves at the institution that houses 
it, and rarely, if ever, at the national level. And yet without such 
science, the value of those sites and collections would be all the 
more threatened. 

To its credit, the European Commission recognizes such problems, 
and has over the past three years funded the Net-Heritage programme, 
gathering an overview of national programmes and identifying 
key weaknesses and strengths. The programme, which finishes at 
the end of this month, has opened up valuable conversations and 
coordination between Europe’s ministries of culture and other 
research funders. 

What next at the pan-European level? Although the European Com- 
mission will continue to have an operational role, the responsibility 
for coordinated research funding passes to national governments, 
through a European Joint Programming Initiative (JPI) entitled Cul- 
tural Heritage and Global Change. As a result of the Net-Heritage 
discussions and JPI programmes, there are likely to be collaborations 
between two or more countries and joint calls for research, in which 
money does not cross national borders but synergy between parallel 
national interests is acquired. 

However, there are concerns about continuity. The JPI primarily 
involves collaborations between ministries of research rather than 
culture, and within each country the culture and research ministries 
tend not to connect effectively, whether from conflicting pressures or 
turf protectionism. Accordingly, it is essential that those involved in 
the JPI, not least the Italian government spearheading it, do whatever 
they can to prevent such obstacles from undermining achievements 
in focusing European efforts. Above all, this research will depend on 
champions at government and institutional levels, as each country 
battles with budgetary pressures. Otherwise, 
the spotlight thrown on heritage science by the 
Net-Heritage programme and its portal will 
dim, and the fledgling policy coherence it brings 
will go to waste. m 
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CORRECTION 

The Editorial ‘Heritage threats’ (Nature 
477, 510; 2011) originally stated that 
Ireland was leading the European Joint 
Programming Initiative. In fact, Italy has 
overall responsibility for coordination. 
The text has been changed to reflect this. 
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WORLD VIEW yeennicos sen 


scientists are spending more money, writing more papers and 

building more shiny institutes. Almost all grant applications 
suggest that a positive funding decision will support research that 
could lead to new treatments for condition X — usually a growing 
scourge of modern society. 

Many medical discoveries have made real differences to the lives 
of a great number of people, but could the research be done better? 

It seems self-evident that we should encourage high-quality work, 
but what makes for high quality is a matter of opinion, which hardens 
over the years into dogma on the assumption that the most established 
and most venerated got there for a reason, so if one wishes their good 
opinion then one should do as they did. 

Take experiments that use animals to model 
human diseases. Empirical study of the quality 
of these experiments is an emerging field, but it 
does suggest that all is not well. The most reliable 
animal studies are those that: use randomiza- 
tion to eliminate systematic differences between 
treatment groups; induce the condition under 
investigation without knowledge of whether or 
not the animal will get the drug of interest; and 
assess the outcome in a blinded fashion. Studies 
that do not report these measures are much more 
likely to overstate the efficacy of interventions. 

Unfortunately, at best one in three publica- 
tions follows these basic protections against bias’. 
This suggests that authors, reviewers and editors 
accord them little importance. 

Other basic aspects of the design of experi- 
ments in animals also receive scant attention. In 
the face of pressures to reduce the number of animals used, investiga- 
tors often do studies that are too small to detect a significant effect. 
To guard against such ‘underpowered’ studies, researchers should 
calculate the number of animals required to have a reasonable chance 
of detecting the anticipated effect given the expected variance of the 
data. Fewer than one in one hundred such publications report sample- 
size calculations’. 

Fewer still define beforehand the most important (‘primary’) out- 
come. Asa result, they tend to report only the outcomes that happen 
to show statistical significance, reducing a rigorous, hypothesis-testing 
experiment to something more like observational research. 

The tendency to publish only positive results is another flaw in 
animal research. Such bias not only prevents scientists from getting 
credit for high-quality research that happens to 
be neutral, but also gives a false impression of NATURE.COM 
efficacy. My research has shown that in animal _ Discuss this article 
tests of treatments for focal cerebral ischaemia __onlineat: 

(a model for stroke), publication bias leads to an _go.niature.com/zdgwat 


Ts is the golden age of medical research. Around the world, 


CHEAP, 
UNDER- 
POWERED 


STUDIES ARE MORE 
LIKELY TO HAVE 


EXCITING 
(IF FALSE) RESULTS 
THAN LARGE, 
EXPENSIVE STUDIES. 


Why animal research 
needs to improve 


Many of the studies that use animals to model human diseases are too small 
and too prone to bias to be trusted, says Malcolm Macleod. 


overestimation of drug efficacy by about one-third’, increasing risk 
for both clinical-trial participants and the pharmaceutical industry. 

Experimental approaches are not very different throughout the life 
sciences, so the biases are probably similar too. A scientist's environ- 
ment is full of potential hazards, such as non-renewal of funding, and 
potential rewards — getting published and receiving grants. As long as 
cheap, underpowered studies are more likely to have exciting positive 
(if false) results than expensive, well conducted, large studies — and 
as long as journals don’t seem to know the difference — the pressure 
will remain to do what everyone else does. 

So we need to change the rules. If publication in high-impact jour- 
nals continues to be a yardstick, then the review process must do 
much more to assess bias. The ARRIVE (Ani- 
mal Research: Reporting In Vivo Experiments) 
guidelines’, endorsed by, among others, Nature 
Publishing Group, are a good start. But, as Don 
Quixote observed, the proof of the pudding will 
be in the eating. 

There must also be better ways to publish 
neutral studies. If the focal cerebral ischaemia 
literature reflects the life sciences generally, 
then 16% of studies go unpublished, and tack- 
ling publication bias would increase the number 
of manuscripts published every year by 160,000. 
At current growth rates we would expect this 
increase anyway over the next four years, so 
sorting out publication bias should be possible. 

At the very least, we should look for ways to 
register all experiments — so that investigators 
can receive credit for work done and so that 
those seeking to summarize what is known 
have access to all relevant data. Such a system could be flexible, with 
information embargoed for a time to protect intellectual property. 

It is hugely distressing to hear highly motivated young scientists 
say that they would prefer to do their research ‘properly’, but that if 
they don’t get more published from their PhD work they will never 
find a postdoc position. They feel forced to lower their standards. 
We owe it to them to create an environment in which the rewards for 
conducting high-quality research are more immediately apparent. m 


Malcolm Macleod is a clinical neuroscientist at the University of 
Edinburgh, UK, and a member of CAMARADES (Collaborative 
Approach to Meta Analysis and Review of Animal Data from 
Experimental Studies). e-mail: malcolm.macleod@ed.ac.uk 
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Pe RESEARCH 
Faster than light? 


Physicists were this week 
rushing to pore over the details 
of an Italian experiment 
suggesting that neutrinos 

can travel faster than light. 
The experiment, OPERA, 

is a neutrino detector under 
the Gran Sasso National 
Laboratory near LAquila. 

It saw a beam of neutrinos 
travelling 730 kilometres 
from CERN, Europe's 
particle-physics laboratory in 
Geneva, Switzerland, arrive 
60 nanoseconds earlier than 
light would if travelling in 
avacuum. The results were 
unveiled on 22 September. See 
page 520 for more. 


RIP XMRV 


Researchers have partially 
retracted a controversial 

study reporting the discovery 
of a virus called XMRV in 
blood from patients with 
chronic fatigue syndrome 

(V. C. Lombardi et al. Science 
326, 585-589; 2009). The 
findings came under attack 
when other laboratories were 
unable to reproduce them (see 
Nature 471, 282-285; 2011). 
On 22 September, Science 
published the retraction 
together with the findings 

of a working group that had 
distributed patient samples to 
nine independent laboratories. 
Only two could find XMRV or 
related viruses, and even those 
labs did not reproduce their 
findings when given replicate 
samples. See go.nature.com/ 
olxles for more. 


Nuclear reaction 


The International Atomic 
Energy Agency (IAEA) will 

be obliged to speak out in 
times of crisis, under an action 
plan approved by member 
states on 22 September. The 
agency was widely criticized 


Measuring the ocean’s saltiness 


This map showing variations in saltiness in the 
world’s oceans is the first produced by NASAs 
Aquarius satellite, which launched in June. The 
probe is a collaboration with Argentina’s space 
agency, CONAE, and picks up weak micro- 
wave radiation emitted naturally by the ocean. 
The radiation varies according to the electrical 
conductivity of the water, which in turn is tied 
to salinity. Because salinity is linked to water 


density and evaporation, the data could help to 
confirm theories about the global water cycle and 
its response to climate change. Most of the features 
apparent in the map are well known — suchas the 
greater salinity of subtropical regions that experi- 
ence more evaporation from sunlight. The results, 
released on 22 September, will be combined with 
data from the European Space Agency’s Soil Mois- 
ture and Ocean Salinity (SMOS) mission. 


for failing to give clear and 
independent assessments of 
the nuclear accident at the 
Fukushima Daiichi power 
plant in Japan in March. The 
new plan calls on the IAEA to 
provide “timely, clear, factually 
correct, objective and easily 
understandable information 
during a nuclear emergency”. 
The agency will also consider 
reforming its seven-point 
nuclear-emergency scale. See 
go.nature.com/c7ecgr for 
more. 


Disease summit 


Without committing 
themselves to specific targets, 
world leaders pledged action 
on non-communicable disease 
at a United Nations summit 

in New York last week. The 
meeting emphasized the toll 
of conditions such as heart 
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attack, cancer and diabetes 
(see Nature 477, 260-261; 
2011), and discussed actions 
such as controls on salt and 
tobacco consumption. It will 
be up to individual nations to 
put such controls into place. 


UK science cuts 

The Royal Society in London 
has written to Britain’s 
physical sciences funding 
agency, telling it to “pause” 
controversial changes to its 
funding strategy. The letter, 
released on 22 September and 
co-signed by other UK learned 
societies, came after months 
of discontent from scientists 
at how the Engineering and 
Physical Sciences Research 
Council was passing on 
government budget cuts. 
Researchers complain that 
the agency has not consulted 


widely enough, and that it has 
not used rigorous evidence 

to decide on measures such 
as reducing support for 
synthetic organic chemistry. 
The agency's council meets 

in October to consider the 
criticisms. 


Working balance 
The US National Science 
Foundation, one of the 
country’s main sources 

of research funding, has 
announced a plan to 
encourage more flexible 
workplace policies. The 
changes include allowing 
scientists to delay or suspend 
grants for up to a year to care 
for new children or fulfil other 
family obligations, the agency 
says. It asked universities and 
research institutes to adopt 
similar policies. 


SOURCE: NASA/GSFC/JPL-CALTECH 
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Murder plea 

Amy Bishop, the biologist 
who shot and killed three 
colleagues, and injured three 
others, at the University 

of Alabama in Huntsville 
last year (see Nature 465, 
150-155; 2010) entered a 
plea of not guilty “by reason 
of mental disease or defect’, 
in an Alabama courtroom 
on 22 September (pictured). 
Bishop’s trial will begin on 
19 March 2012. 


Federal fraud 

The head of a US 
biotechnology company 
that received federal funds 
to research vaccines has 
been indicted for fraud. 
Jian-Yun Dong, president 
of GenPhar in Mount 
Pleasant, South Carolina, 
and an immunologist at the 
Medical University of South 
Carolina in Charleston, 

is charged (along with an 
unnamed vice-president) 


TREND WATCH 


Global greenhouse-gas 


emissions increased by 5.8% in 


2010 to hit an all-time high of 


33 billion tonnes, as continued 
growth in developing countries 


swamped both greater use of 
renewable power and gains in 
energy efficiency, according 


to an analysis by the European 


Commission's Joint Research 


Centre and the PBL Netherlands 


Environmental Assessment 


with making “false, fictitious 
and fraudulent” statements 
in 2004-11 in order to get 
research grants from the 
National Institutes of Health, 
among other agencies. The 
indictment — filed at the US 
district court in Charleston 
in April and publicly released 
on 19 September — says 

that Dong obtained at least 
US$3.6 million as a result of 
his offenses. 


Green leader dies 


Inspirational environmental 
campaigner Wangari 

Maathai died in Nairobi on 

25 September, at the age of 
71. Maathai was the founder 
of the African tree-planting 
initiative the Green Belt 
Movement, which aims 

to empower women and 
improve the quality of their 
lives by providing better 
access to clean water and 
firewood while conserving 
the environment. A veterinary 
scientist, Maathai won the 
Nobel Peace Prize in 2004 

for her work in supporting 
democracy, human rights and 
the environment. 


Sequencing sags 
DNA-sequencing firm 
Pacific Biosciences is laying 
off 130 employees, or 28% 
of its staff, the company said 
on 20 September. Based in 
Menlo Park, California, the 


firm started to ship its first 
commercial machines for 
real-time, single-molecule 
DNA sequencing in April. But 
demand has not climbed as 
rapidly as hoped. Some other 
sequencing companies, such as 
Life Technologies in Carlsbad, 
California, also announced 
lay-offs this year, as the 
sequencing market reaches 
saturation ahead of what 
analysts expect to be a few 
difficult years for the budgets 
of research funders such as 
the US National Institutes of 
Health. 


Shadows on solar 


Republican politicians have 
ramped up criticism of the US 
government's clean-energy 
loan-guarantee programme, 
which has come under fire 
since the bankruptcy of solar- 
energy company Solyndra, 
based in Fremont, California, 
early this month. A House of 
Representatives subcommittee 
is asking whether the 
Department of Energy 

was too hasty in granting 

the firm a US$535-million 
loan guarantee in March 
2009. On 23 September, 

two Solyndra executives 
refused to give testimonies at 
the subcommittee hearing. 
Meanwhile, US solar firms 
First Solar, based in Tempe, 
Arizona, and SolarCity, 

of San Mateo, California, 
have blamed red tape for 
their failures to secure 


LONG-TERM TRENDS IN CO, EMISSIONS 


Growth in developing nations is swamping any cuts in greenhouse-gas 


emissions by industrialized countries. 
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30 SEPT-2 OCT 

The fourth annual 
meeting on personal 
genomes, at Cold Spring 
Harbor, NewYork, 

looks at the issue of 
incorporating genome 
data into routine clinical 
practice. 
go.nature.com/hozwho 


3-5 OCTOBER 

The winners of the 

2011 Nobel prizes for 
physiology or medicine, 
physics and chemistry 
are announced in 
Stockholm. 
www.nobelprize.org 


3-5 OCTOBER 

The world stem-cell 
summit in Pasadena, 
California, focuses on 
progress in translating 
stem-cell research for 
the clinic. 
go.nature.com/kov6xu_ 


loan guarantees before the 
end of this fiscal year on 
30 September. 


Stem-cell trials 


The biotechnology firm 
Advanced Cell Technology 
has been given the go ahead 

to conduct a trial that makes 
use of human embryonic 

stem (ES) cells in the United 
Kingdom — the first such 
approval outside the United 
States. The company, which 

is headquartered in Santa 
Monica, California, is already 
conducting US trials; the UK 
trial will be a repeat version, 
using retinal cells derived from 
human ES cells to treat people 
with Stargardt’s disease, an 
inherited form of progressive 
blindness. The company says 

it is in late-stage talks with 
regulators and clinicians in 
France, China and elsewhere to 
launch further global trials. See 
go.nature.com/ogdjgt for more. 


> NATURE.COM 
For daily news updates see: 
www.nature.com/news 
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Escalating the Improbable Aboriginal 
fight against childhood neutrino result prompts genome addresses 
diarrhoea p.519 hunt for cause p.520 consent issues p.522 


Four fixes for 
costly, failure-prone 
clinical trials p.526 
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CLIMATE CHANGE 


Indian carbon-credit claims — such as those for the Baglihar Dam (above) — are under scrutiny. 


Clean-energy 
credits tarnished 


WikiLeaks reveals that most Indian claims are ineligible. 


BY QUIRIN SCHIERMEIER 


s the world gears up for the next round 
Ae United Nations climate-change 
negotiations in Durban, South Africa, 

in November, evidence has emerged that a 
cornerstone of the existing global climate 
agreement, the international greenhouse-gas 
emissions-trading system, is seriously flawed. 
Critics have long questioned the usefulness 
of the Clean Development Mechanism (CDM), 
which was established under the Kyoto Proto- 
col. It allows rich countries to offset some of 
their carbon emissions by investing in climate- 
friendly projects, such as hydroelectric power 
and wind farms, in developing countries. Veri- 
fied projects earn certified emission reductions 


(CERs) — carbon credits that can be bought 
and sold, and count towards meeting rich 
nations’ carbon-reduction targets. 

But a diplomatic cable published last month 
by the WikiLeaks website reveals that most of 
the CDM projects in India should not have 
been certified because they did not reduce 
emissions beyond those that would have been 
achieved without foreign investment. Indian 
officials have apparently known about the 
problem for at least two years. 

“What has leaked just confirms our view 
that in its present form the CDM is basically a 
farce, says Eva Filzmoser, programme direc- 
tor of CDM Watch, a Brussels-based watch- 
dog organization. The revelations imply that 
millions of tonnes of claimed reductions in 


greenhouse-gas emissions are mere phantoms, 
she says, and potentially cast doubt over the 
principle of carbon trading. “In the face of 
these comments it is no wonder that the United 
States has backed away from emission trading,” 
Filzmoser says. 

The cable, written on 16 July 2008, was sent 
by the US consulate in Mumbai, India, to the 
US secretary of state, and summarizes a discus- 
sion of the CDM involving representatives of 
the consulate and the US Government Account- 
ability Office, along with Indian officials and 
executives of large Indian companies. At the 
time, 346 Indian projects had been registered 
with the CDM’s executive board. Today, more 
than 720 Indian projects have been approved 
and have gained some 120 million tonnes’ worth 
of carbon credits, a large fraction of the 750 mil- 
lion tonnes issued since 2005 (see ‘Cleaning up’). 

Yet on the evidence of discussions at the 
meeting, most of the carbon-offset projects in 
India fail to meet the CDM requirements set by 
the UN Framework Convention on Climate 
Change. The cable also describes the UN’s vali- 
dation and registration process as “arbitrary”. 

Indian authorities were also criticized in 
the cable. All CDM projects must be validated 
nationally, then verified independently by an 
accredited firm. But the cable quotes R. K. Sethi, 
then chairman of the CDM’s executive board 
and member-secretary of the Indian CDM 
authority in New Delhi, as admitting that the 
authority simply “takes the project developer at 
his word for clearing the additionality barrier”. 

“This will not invalidate carbon trading, but 
it does go to show that the CDM has serious 
flaws,’ says Mark Maslin, a climatologist at Uni- 
versity College London. “In India and China, 
the multiple levels of governance which you 
need to have in place to make carbon trading 
work are simply not there.” 

Martin Hession, head of global carbon 
markets at the UK Department of Energy and 
Climate Change, and chairman of the CDM 
executive board, says that the critical remarks 
in the cable date from a time when “people 
were complaining a lot” about problems with 
the CDM. Controversies over whether specific 
projects reduce net emissions are still common, 
he says. But since 2008, the board has followed 
more stringent guidelines for verifying the eli- 
gibility of projects and for enhancing the overall 
efficiency of the scheme. 

“The CDM is much more transparent and 
predictable than the tenor of these remarks 
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> might suggest,” he says. “We reject many 
projects in India and China because they fail 
to meet the required criteria, and we do in fact 
often get the message that project validation has 
become too stringent.” 

Others argue that the rules are still not rigor- 
ous enough. In some circumstances, the CDM 
may actually have encouraged the production 
in developing countries of the coolant chemi- 
cal HFC-23, an extremely potent greenhouse 
gas (M. Wara Nature 445, 595-596; 2007). 
Critics have demanded harsher sanctions 
against validating companies found guilty of 
lax oversight, together with clearer conflict-of- 
interest policies and tighter rules on what quali- 
fies as an additional clean-development project. 

International Rivers, an environmental 
campaigning group based in Berkeley, Califor- 
nia, is now calling on the CDM executive board 
to reject the 412-megawatt Rampur hydro- 
power project in Himachal Pradesh, India, 
which is awaiting CDM approval. The project 
could earn some 15 million carbon credits 
from 2012 to 2022, amounting to an estimated 
US$150-million windfall for the Shimla-based 
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Roughly one-fifth of projects in the Clean 
Development Mechanism are registered in India 
— but most do not comply with the scheme. 
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developer Satluj Jal Vidyut Nigam Limited, the g 
group says. But the decision to finance that 2 
project was taken long before the CDM was 
even created, says Himanshu Thakkar, direc- 
tor of the Delhi-based South Asia Network on 
Dams, Rivers and People, clearly invalidating 
its application. 

The company stands by its claim that the 
project qualifies for the CDM, and says that the 
Indian government approved the investment 
proposal for the project in 2007, when the 
CDM was already in place. As Nature went to 
press, the CDM’s executive board, which met in 
Quito, Ecuador, this week, had not yet decided 
whether to approve the Rampur project. 

Despite the controversy, the European 
Union seems determined to continue its 
mandatory emissions-trading system, which 
it sees as crucial in tackling climate change. 
There's little doubt about the urgency of that 
goal: global carbon dioxide emissions have 
increased by 45% since 1990, reaching an 
all-time high of 33 billion tonnes in 2010, 
according to a report released last week by the 
European Commission (see page 515). m 


Europe to map the 
human epigenome 


DNA-modification studies get a multi-million euro boost. 


BY ALISON ABBOTT 


r Nhe health-research division of the 
European Commission launches its 
largest-ever project next week with a 

€30-million (US$41-million) investment in 

understanding the human epigenome, the 
constellation of DNA modifications that shape 
how genes are expressed. 

With the project, called BLUEPRINT, 
Europe intends to become a major player in 
the International Human Epigenome Consor- 
tium (IHEC), set up last year to help biologists 
understand how the epigenome influences 
health and disease. 

All the cells that make up an individual orig- 
inate from the same fertilized egg and share the 
same genome. But during development, cells 
acquire epigenetic changes — such as chemi- 
cal modifications to DNA, and changes in the 
shape of its tightly coiled three-dimensional 
structure — that affect which genes will be 
active in which cells at a given time and which 
will be silent. 

The epigenome then remains largely sta- 
ble, so that a liver, for example, remains a 


liver throughout life. But small parts of the 
epigenome change constantly as the cell 
responds to environmental changes. The epige- 
nome also changes in diseases, including cancer. 

The importance of the epigenome in health 
and disease is becoming increasingly clear. 
But researchers studying it have faced a big 
hurdle: the lack of a reliable library of high- 
quality, quantitative reference epigenomes 
against which new data can be compared. Lit- 
tle is known, even, about how much the epig- 
enome normally varies between individuals, 
or between the different cells in an individual. 

Enter BLUEPRINT, which unites 41 institu- 
tions and more than 50 principal investigators 
across Europe who will contribute a further 
€10 million to the project. BLUEPRINT will 
provide at least 100 reference epigenomes 
toward the IHEC’s goal of amassing 1,000 ref- 
erence epigenomes by 2020. 

BLUEPRINT has chosen to focus on the 
blood system. This should help move discov- 
eries quickly into the clinic, as many diagnostic 
tests rely on blood samples. “Blood is also what 
gets stored in biobanks and used for genome 
analysis,’ says BLUEPRINT coordinator 
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Henk Stunnenberg at the Nijmegen Center 
for Molecular Life Sciences in the Netherlands. 
Beyond these conveniences, there was also a 
compelling biological rationale for the choice, 
he says. Unlike most tissues, blood cells are 
constantly renewed, so blood comprises a mix 
of cells at different stages of maturity. “Blood- 
cell epigenomes may reveal some general rules 
about how cells develop,’ Stunnenberg says. 

BLUEPRINT will generate reference epi- 
genomes from 60 different cell types, taken 
from the blood of healthy individuals stored 
in the UK’s national blood bank. Each epig- 
enome will include a full genome sequence, 
and genome-wide quantitative data on the 
occurrence and distribution of nine different 
epigenetic markers. 

For comparison with healthy epigenomes, 
the consortium will produce reference epig- 
enomes for more than 60 blood-cancer cell 
types. It will carry out experiments in mice to 
work out how much of the epigenome is her- 
itable. The consortium also plans to generate 
lower-resolution epigenomes from two blood- 
cell types from 100 healthy people to provide 
a first quantitative indication of natural indi- 
vidual variation. 

“BLUEPRINT is the first big epigenome 
project to be specifically created in alignment 
with the IHEC mission,” says Peter Jones at 
the University of Southern California in Los 
Angeles, who helped to launch the IHEC. 
“Blood epigenomes are particularly exciting 
because we know an awful lot about the biol- 
ogy of how blood stem cells differentiate, but 
little about the sequence of epigenomic events 
involved in the processes that are going to be 
relevant for disease. m 
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Vaccine campaign to target 
deadly childhood diarrhoea 


Programme to guard against second- biggest killer of under -fives rolls out across Africa. 


BY DECLAN BUTLER 


very year, more than one 
Bante: children under the 
age of five die as a result of 
diarrhoea. It is the second-big- 
gest killer in this age group, after 
pneumonia, and 40% of diarrhoea 
deaths are caused by rotavirus. 
That horrific toll could soon 
fall, thanks to the first major 
roll-out of rotavirus vaccines in 
Africa, where half of rotavirus 
deaths occur. The programme 
was unveiled this week by the 
GAVI Alliance — formerly the 
Global Alliance for Vaccines 
and Immunisation — based in 
Geneva, Switzerland. The group 
hopes to immunize 50 million 
children against rotavirus in 40 of 
the world’s poorest countries by 
2015. Although rotavirus gastro- 
enteritis is common worldwide, 
it kills very few children in richer 
countries, where hospitalization 
and intravenous rehydration 
are readily available, and where 
vaccination is becoming more 


ROTAVIRUS ROLL-OUT 


The vaccine will be given to children in 13 African countries, where 


rotavirus is responsible for high child mortality rates. 
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tRoll-out began in July 
Vaccine roll-out in Armenia, Georgia, Moldova and Yemen not shown 


common. 
GAVt has already funded suc- 
cessful rotavirus-vaccination programmes in 
Nicaragua, Bolivia, Guyana and Honduras, but 
the effort is now ramping up to cover 13 coun- 
tries in Africa and a further 4 elsewhere (see 
‘Rotavirus roll-out). “It’s an incredibly exciting 
time, to see this vaccine finally reaching the 
children in Africa,’ says John Wecker, head of 
vaccine access and delivery at the Program for 
Appropriate Technology in Health (PATH), 
an international organization based in Seattle, 
Washington, that runs the Rotavirus Vaccine 
Program (RVP) in partnership with the World 
Health Organization (WHO) and the US 
Centers for Disease Control and Prevention. 
In the pastit has often taken 15-20 years for 
a vaccine approved in rich countries to reach 
poorer ones, but GAVI funding has sharply 
reduced that lag for 
rotavirus and some 
other vaccines. The cur- 
rent rotavirus vaccines 
were first approved by 
the WHO and the US 


NATURE.COM 
Read more in 
Nature's vaccines 
special issue at: 
nature.com/vaccines 


Food and Drug Administration just five years 
ago, on the basis of clinical trials in rich and 
middle-income countries. The RVP’s trials of 
the vaccines in poorer countries were crucial 
for a wider roll-out, even though they showed 
that the vaccines are not quite as effective in 
those areas, where children are often malnour- 
ished and have other infectious diseases. The 
trials have proved, however, that the vaccines 
do reduce case numbers and, more impor- 
tantly, substantially prevent serious disease 
and deaths. Unexpectedly, the vaccines also 
protect against multiple, fast-mutating strains 
of rotavirus, alleviating concerns among 
scientists that many different rotavirus vac- 
cines might have been needed, with frequent 
reformulations. 

Over the past few years, the RVP has sought 
to convince health officials in developing 
countries to push for vaccination, using evi- 
dence of the disease burden and the vaccines 
effectiveness in their countries. It has helped 
to build disease-surveillance networks that 


will aid in identifying the places 
of greatest need, assessing the 
effectiveness of the vaccines and 
monitoring their safety. 

Rotavirus-vaccine development 
suffered a setback in the 1990s, 
when RotaShield, made by Wyeth, 
had troublesome side effects. It 
was introduced in the United 
States in 1998, but was pulled 
from the market a few months 
later, after evidence arose that it 
elevated the risk of intussuscep- 
tion, an intestinal blockage than 
can prove fatal, by 1-2 cases for 
every 10,000 vaccinated children. 

Clinical trials of the newer 
Rotarix and RotaTeq vaccines, 
each comprising more than 60,000 
children, found no elevated risk of 
intussusception. Later surveillance 
of the roll-out in Latin America, 
however, did show a risk, albeit far 
lower than that caused by Rota- 
Shield, as was reported in June (M. 
M. Patel et al. N. Engl. J. Med. 364, 
2283-2292; 2011). The WHO and 
national regulators who reviewed 
the data say that the benefits far 
outweigh the slightly elevated 
intussusception risk of 1-2 in 
100,000 vaccinated children, says Wecker. 

GlaxoSmithKline, which makes Rotarix, 
has offered to make its vaccine available for 
the GAVI campaign at US$2.50 per dose 
(vaccination requires two doses). That is 67% 
less than the current lowest price for the vac- 
cine, although negotiations are still ongoing. 
Merck has promised to discount RotaTeq. 
GAVI expects prices to fall further as several 
manufacturers in emerging economies bring 
their own vaccines into production over the 
next few years. 

The rotavirus effort follows the December 
2010 launch of GAVT’s vaccine programme tar- 
geting pneumococcus, a bacterium that causes 
pneumonia and other infections. This week, 
the alliance expanded that programme from 
19 to 37 developing countries. Together, the 
campaigns are “definitely a game changer, no 
question’, says Wecker. “The vaccines will sig- 
nificantly reduce the burden of diarrhoea and 
pneumonia, the two leading killers of under- 
fives in developing countries.” = 
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RACING LIGHT 


By comparing the proton signal at CERN to the resulting neutrino signal at Gran Sasso, the OPERA 
experiment was able to calculate the neutrinos’ time of flight as they passed through Earth. 
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Speedy neutrinos 
challenge physicists 


Experiment under scrutiny as teams prepare to test claim 
that particles can beat light speed. 


BY EUGENIE SAMUEL REICH 


r | Vhe joke begins with the barman saying: 
“Tm sorry, we don’t serve neutrinos.” 
Then the punch line: a neutrino walks 
into a bar. 

Such causality-bending humour has been 
rife on the Internet in the past week, following 
the news that an experiment at the Gran Sasso 
National Laboratory near LAquila, Italy, has 
apparently clocked neutrinos exceeding the 
speed of light as they travelled 730 kilometres 
from their source at CERN, Europe’ particle- 
physics laboratory near Geneva, Switzerland. 

The finding by the OPERA (Oscillation 
Project with Emulsion-tracking Apparatus) 
collaboration, released on 22 September, 
has the media abuzz with talk of a century’s 


worth of physics upended, starting with Albert 
Einstein's special theory of relativity. This sets 
the velocity of light as the inviolable and unat- 
tainable limit for matter in motion, and links 
it to deeper aspects of reality, such as causality. 

Physicists, for the most part, suspect that 
an unknown systematic error lies behind 
OPERAS startling result. But nothing obvious 
has emerged, and many see the experiment as 
a tour de force because of its high precision. 
“Tt is quite a complicated experiment but they 
did a professional job,” says Rob Plunkett, 
co-spokesman for the MINOS (Main Injector 
Neutrino Oscillation Search) experiment at 
Fermilab in Batavia, Illinois, which is likely to 
investigate the claim. 

OPERA was switched on in 2006 to study the 
peculiar ability of the fleeting, nearly massless 
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neutrinos to ‘oscillate’ from one type to others 
as they travel. It was later adapted to measure 
neutrino velocity to a higher precision than 
any previous experiment. A crucial aspect of 
this was the pioneering use of the Global Posi- 
tioning System (GPS) to establish a time refer- 
ence at both ends of the neutrino’ flight path. 
Another was the careful timing of particles 
through different stages of the experimental 
apparatus, using a pair of ultra-high-accuracy 
caesium clocks to determine sources of delay. 

In March 2011, the group was shocked to 
discover that its data suggested neutrinos 
were arriving 60 nanoseconds faster than light 
would do over the equivalent distance. “After 
six months of cross-checking we decided to 
go public,” says Dario Autiero of the Institute 
of Nuclear Physics in Lyons (IPNL), France, 
who is the physics coordinator for OPERA. 
Most members of the collaboration agreed 
with the decision, but physicist Caren Hagner 
of the German Electron Synchrotron (DESY) 
in Hamburg was among those who declined 
to put their name to the result. In her case, she 
says, it was not because of any specific prob- 
lem, but because she would have liked to spend 
more time checking the finding. 

The release of the team’s data on the arXiv 
server (The OPERA Collaboration, preprint 
at http://arxiv.org/abs/1109.4897; 2011) and 
in a presentation (see go.nature.com/kl4jah) 
now have the neutrino community looking for 
sources of error that might explain the result. 
Two elements of the experiment receiving par- 
ticular scrutiny include the GPS-synchroniza- 
tion system and the profile of the proton beam 
that generates the neutrinos as a by-product of 
colliding with a target. Experimenters deter- 
mined the flight time by comparing the shape of 
the proton signal at CERN to that of the neutrino 
signal received at Gran Sasso (see ‘Racing light’). 

Two other collaborations are looking to 
check the OPERA result independently. 
Neither can yet time neutrinos to the same 
level of precision; however, an upgrade is 
already planned for MINOS, which sends a 
beam of muon neutrinos from Fermilab to 
the Soudan mine in Minnesota — roughly the 
same distance as from CERN to Gran Sasso. 

Japan's T2K experiment, which sends neu- 
trinos 295 kilometres from an accelerator in 
Tokai to the Super-Kamiokande detector in 
Kamioka, was shut down after the earthquake 
on 11 March. Scientists there are also discuss- 
ing an upgrade to check OPERAS result, says 
co-spokesman Chang Kee Jung. 

The upgrades would take more than a year to 
implement. In the meantime, researchers with 
both the T2K and MINOS collaborations are 
taking a second look at their existing data to 
see whether they are consistent with OPERA’ 
result. Plunkett says that the MINOS group 
might have an answer within a few months. 
And even if the speed of light remains unbro- 
ken, a move to more accurate timing will 
bolster experiments in the long run. m 
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‘Patent trolls’ target 
biotechnology firms 


Opportunistic patent litigation threatens the life sciences. 


BY ERIKA CHECK HAYDEN 


he biotechnology industry has had its 
ik of woes, but so far ‘patent trolls’ 

have not numbered among them. These 
companies, which profit by legally enforcing 
patents they own rather than developing prod- 
ucts, may benefit from a 31 August ruling at a 
US federal court of appeal in Washington DC. 

The court upheld a lawsuit filed by Classen 
Immunotherapies of Baltimore, Maryland, 
against four biotechnology companies and a 
medical group, for infringing on a patent that 
covered the idea of trying to link infant vacci- 
nation with later immune disorders. A district 
court had thrown out the lawsuit, finding that 
the concept at the heart of the case amounted 
to an abstract idea that could not be patented. 
The appeals court found otherwise. 

Beyond its complex particulars, the case 
sets “a troubling precedent’, says James Bes- 
sen, a lawyer at the Boston University School 
of Law, Massachusetts, “because you're patent- 
ing something that’s very broad”. (The patents 
include the act of reading the published scien- 
tific literature and using it to create vaccination 
schedules that minimize immune disorders.) 

Joseph Zito, the plaintiff’s lawyer, says that 
his client, John Barthelow Classen, was the first 
researcher to connect vaccination schedules 
to immune disorders through animal studies. 
“He doesnt want to stop anybody,’ Zito says. 
“He wants to make sure people use vaccines 
safely.” But very broad patents have posed a 
problem in the technology field, where some 
firms amass vast portfolios of patents bought 
up from inventors and look for targets to sue. 


HARMING SOCIETY 

Ina study released on 19 September, Bessen 
and his co-authors estimate that patent lawsuits 
filed by trolls, also known as ‘non-practicing 
entities’ (NPEs), caused defendants to lose 
more than US$500 billion in value from 1990 
to 2010 (ref. 1). The figure was calculated by 
analysing the effect on defendants’ stock prices 
when they were sued. 

“We conclude that the loss of billions of dol- 
lars of wealth associated with these lawsuits 
harms society,” the authors write. 

In 2009, Colleen Chien, a lawyer and 
patent expert at Santa Clara University in 
California, calculated that the proportion of 
patent-litigation cases involving NPEs grew 


COURTING CHAOS 


US patent litigation involving ‘non-practicing 
entities’ (NPEs) is rising (dashed line) as is the 
proportion of defendants in NPE cases (solid line). 


NPE law suits as a % of total su 
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from 10% to 20% between 2000 and 2008. The 
proportion of defendants whose cases involve 
NPEs — which is disproportionately high — 
rose from 22% to 36% in the same period” 
(see ‘Courting chaos’). Roughly two-thirds of 
all suits are over software patents, but experts 
warn that the life sciences are not immune. 

“There is definitely reason for concern in 
the biotech field, says Julie Samuels, a staff 
attorney at the non-profit Electronic Frontier 
Foundation in San Francisco, California. 

So far, Chien says, biotechnology companies 
have not been as vulnerable because their pat- 
ents are not as broad as software patents, and 
require more time and expertise to develop. 
Yet the Classen case shows that broad patent 
claims can survive in court. And some of the 
largest patent holders are showing signs that 
they are interested in the biotechnology sector. 
For example, Intellectual Ventures of Bellevue, 
Washington, founded by Nathan Myhrvold, a 
former Microsoft executive, owns hundreds of 
patents in the biotechnology industry. The com- 
pany says it has no plans to litigate in the field. 

A weak economy could also become a factor 
if it leads biotech companies to sell off their 
patents to opportunistic litigators. And if com- 
panies start profiting from troll behaviour, the 
practice will spread, says lawyer Robert Cook- 
Deegan of Duke University in Durham, North 
Carolina. “If people do it and get away with it, it 
sets up an incentive for other people to behave 
the same way,’ he says. m 


1. Bessen, J. E. et al. Boston Univ. School of Law, Law and 
Economics Research Paper 11-45 (2011). 

2. Chien, C. V. North Carolina Law Review 87, 
1571-1615 (2009). 
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Aboriginal genome analysis 
comes to grips with ethics 


Sequencing effort provides a model for future studies of museum samples. 


BY EWEN CALLAWAY 


n route from Sydney to Perth, Australia, 
in the early 1920s, British ethnologist 


Alfred Cort Haddon acquired a tuft of 
human hair from a young Aboriginal man. He 
added it to his sizeable collection of hair from 
people living around the world. 

Ninety years later, those locks have yielded 
the first complete genome sequence of an Abo- 
riginal Australian, and provided clues about 
the timing of human migrations from Africa 
to Asia’ (see ‘Early human explorers charted a 
bold course’). The work has also underscored 
the bioethical dilemmas involved in plumb- 
ing the genomes of indigenous populations 
— especially when the DNA comes from an 
archived specimen such as Haddon’. “To be 
sequencing DNA from the hair of a deceased 
indigenous person is uncharted ethical terri- 
tory,’ says Emma Kowal, a cultural anthropolo- 
gist at the University of Melbourne. 

The genome project, led by Eske Willerslev 
at the University of Copenhagen, received 
approval for the work from a group that rep- 
resents Aboriginals in the region in which the 
man probably lived. But some scientists are jit- 
tery about how others in the Aboriginal com- 
munity might receive the project, and worry 
that it could set back efforts to engage Aborigi- 
nals in genetic research. “In a sense, every Abo- 
riginal Australian has had something about 
themselves revealed to the world without their 
consent,’ says Hank Greely, who directs the 
Center for Law and 


the Biosciences at “Sequencing DNA 

Stanford University from the hair 

in California. of a deceased 
Aboriginal Aus- indigenous person 

tralians endured isuncharted 


centuries of repres- 
sion by European 
colonists, but their wariness of genetic research 
owes much to the Human Genome Diversity 
Project (HGDP). This 1990s international 
collaboration aimed to catalogue the genetic 
diversity of populations worldwide, but sparked 
concerns that indigenous peoples were being 
subjected to neocolonial ‘bioprospecting’ 
“Probably the strongest opposition we ran 
into anywhere in the world” was in Australia, 
says Greely, who was an ethical adviser to the 
project. Plans to include Aboriginal Australian 


ethical territory.” 


Today’s Australian Aboriginals are descended from the first humans to travel far from their African origins. 


DNA were eventually scrapped, and the furore’s 
impact continues to reverberate, says Kowal. 
“The damage that the HGDP has done for the 
prospect of doing genetic research with Abo- 
riginal people has been significant.” Research- 
ers who work with Aboriginal Australians are 
now expected to obtain consent not only from 
the individuals concerned, but also from local 
and sometimes state-wide groups representing 
Aboriginal communities across Australia. 

Sheila van Holst Pellekaan, a geneticist at the 
University of New South Wales in Sydney, says 
that she has encountered long delays in getting 
continued approval for her genetic studies of 
ancestry and disease among Aboriginal Aus- 
tralians, and can share only limited data with 
other researchers. Such obstacles, she says, 
mean that Aboriginals, who suffer from high 
rates of diabetes and heart disease, may miss 
out on advances in genetic medicine. 

Willerslev says that he was not fully aware of 
these issues when his team set out to sequence 
Haddon’s sample. Moreover, the sample came 
with little identifying information, so Will- 
erslev’s team had no idea whom to ask for per- 
mission to study it. 

A Danish bioethical review board did not 
believe it was necessary to review the project 
because it viewed the hair as an archaeological 
specimen and nota biological one, Willerslev 
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says. However, after his team sequenced the 
genome, an Australian colleague put Will- 
erslev in touch with the Goldfields Land and 
Sea Council, a body based in Kalgoorlie, West- 
ern Australia, that represents the 5,000 or so 
Aboriginal Australians living in the region 
where Haddon collected the hair sample. In 
June, Willerslev flew to the region to describe 
his project to the organization's board and to 
seek its approval. He says that if the board had 
rejected his proposal, he would have ended the 
project and left the genome unpublished. 

Craig Muller, the research manager at 
Goldfields Land and Sea Council, says that he 
researched the history of the man’s hair sam- 
ple and concluded that the man had donated it 
freely to Haddon. “Once that was made clear, 
nobody was bothered that there might be some 
underlying problem,” says Muller. 

“Thope that our study will set a precedent on 
how to potentially deal with these issues,” says 
Willerslev, whose team has obtained similar 
approval to sequence the genome of the iconic 
Native American Sitting Bull from alock of hair. 

Despite Willerslev’s efforts, “I would sug- 
gest there would be a certain amount of 
unrest in the indigenous communities”, says 
van Holst Pellekaan. Greely agrees that Will- 
erslev’s team should have reached out to other 
Aboriginal groups. 


P. TWEEDIE/PANOS 


“T think they did everything anyone could 
reasonably expect them to,” counters Mark 
Stoneking, a molecular anthropologist at 
the Max Planck Institute for Evolutionary 
Anthropology in Leipzig, Germany. He pub- 
lished a complementary analysis of Aborigi- 
nal genomes last week’, using DNA samples 
obtained by other scientists with the consent 
of the Aboriginal Australian individuals 
involved. 

The study also raises broader consent issues 
over body parts of indigenous people held in 
museums, says Kowal. Many collections are 
returning bones to these groups, but the Brit- 
ish Museum in London, for instance, generally 
excludes hair and nails from its repatriation 
policy. Such specimens are a valuable tool for 
studying the genomes of people from around 
the world, including populations that no 
longer exist, argues Willerslev. 

Having a set of widely accepted guidelines 
for studying such samples would help to guide 
researchers, journals and funding agencies, says 
Stoneking. “Hopefully some sort of standards 
can be developed so everyone feels comfortable 
going ahead with this research,” he says. m 


1. Rasmussen, M. et al. Science http://dx.doi. 
org/10.1126/science.1211177 (2011). 

2. Reich, D. et al. Am. J. Hum. Genet. http://dx.doi. 
org/10.1016/j.ajng.2011.09.005 (2011). 

3. Reich, D. et al. Nature 468, 1053-1060 (2010). 
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WAVES OF MIGRATION 


Early human explorers charted a bold course 


The first complete genome of an Aboriginal 
Australian is a portrait of a pioneer. 
Comparisons with other genomes indicate 
that Australia’s original settlers parted 
company with other humans more than 
60,000 years ago, suggesting that they were 
among the first to venture far beyond Africa’. 
Like other populations outside Africa, 
the genome carries small chunks of DNA 
that originally came from Neanderthals. 
More surprisingly, it shows that Aboriginals’ 
ancestors also interbred with another 
archaic human population known as the 
Denisovans, only recently identified from 
30,000-50,000-year-old DNA recovered 


from a finger bone found in a Siberian cave’. 


Until now, Papua New Guineans were the 
only modern humans whose ancestors were 
known to have interbred with Denisovans. 

A separate study published last week, 
based on genetic markers in modern-day 
Aboriginal Australians, Asians and Pacific 
Islanders, paints a fuller picture of the 
Denisovans’ genetic legacy. The research, 
led by Mark Stoneking at the Max Planck 


Institute for Evolutionary Anthropology in 
Leipzig, Germany, revealed a patchwork 

in which some populations, including 
Australian Aboriginals, bore varying levels 
of Denisovan DNA, whereas many of 

their neighbours, such as the residents of 
mainland Southeast Asia, contained none’. 

Stoneking says that this pattern suggests 
that an early wave of migration into Asia 
included the ancestors of contemporary 
Aboriginal Australians. This was followed 
by a second wave that gave rise to the 
current residents of mainland Asia. Some 
members of the first wave interbred with 
Denisovans. By the time the second wave 
of Asian migrants arrived, the Denisovans 
may have vanished. The study also suggests 
that the Denisovans’ range once extended to 
Southeast Asia and perhaps Oceania. 

“Put together, these two papers make an 
overwhelming case for multiple waves of 
migration,’ says David Reich, a population 
geneticist at Harvard Medical School in 
Boston, Massachusetts, and an author on 
the second study. E.€. 
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STIMULUS 


SPENDING A WINDFALL 


In 2009, the US government allocated more than US$52 billion in stimulus 
funds to research and development. Three agencies received most of the funds 
for academic research. Much of the money has yet to be spent. 
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RESPUNSE 


The United States’ 2009 financial stimulus bill has provided research with breathing space, 
rather than the sharp shot in the arm that many anticipated. 


hen Cathy Lord arrived in New York last month, 
Wi: US$1.2-million grant was about to run 

out. The autism researcher had won the funding from the 
National Institutes of Health (NIH) under the American Recovery and 
Reinvestment Act (ARRA), better known as the stimulus bill, and the 
money had helped to support her 15-strong team at the University of 
Michigan in Ann Arbor for two years. 

But the expiry of such a major grant didn't leave Lord’s team in the 
lurch — far from it. The grant was not like a standard NIH award. 
Instead of paying for experiments to test a hypothesis for publication 
— something that usually takes at least three years — the money allowed 
Lord's team to compile its existing knowledge, preparing a kit that other 
researchers can use to screen people for social and behavioural indica- 
tors of autism. In that sense, the grant was more of a bonus than a life- 
line. Now, Lord is about to open an autism clinic and research centre at 
NewYork- Presbyterian Hospital in White Plains, and about half of her 
team have followed her there. For them, she says, the end of the ARRA 
money is no different from the usual rough and tumble that results from 
the coming and going of research grants. “If you find good people you 
just try and keep the ball rolling,” she says. 

Such a smooth transition is not what many expected for scientists 
facing the end of a stimulus grant. When the bill — now estimated 
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to be worth $840 billion — was signed into law by 
President Barack Obama in February 2009, it contained 
more than $52 billion for research and development (see ‘Spending 
a windfall’), of which $15 billion was expressly for scientific research 
at the NIH, the National Science Foundation and the Department 
of Energy’s Office of Science (see Nature 461, 856-857; 2009). The 
stimulus package as a whole was designed to create jobs and ease the 
pain of the recession, and at first the administration pledged to get this 
money distributed and spent as quickly as possible — the NIH, for 
example, devoted much of its allocation to two-year grants that were 
meant to run from October 2009 to this month. So even as scientists 
scrambled to get a share of the windfall, some universities and science 
lobby groups were warning of a ‘funding cliff’ when the money — 
which would have boosted total NIH funding by about one-sixth if it 
was spent over two years — came to an abrupt end. 


THE CLIFF THAT WASN'T 

From the beginning, however, the funding pulse didn’t have quite the 
anticipated effects. Duke University in Durham, North Carolina, for 
example, expected a hiring rush after it attracted $210 million in ARRA 
funds, making it one of the ten most successful universities in the coun- 
try in this regard, says Jim Siedow, Duke's vice-provost for research. But 
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staffing barely budged. “We gave a party that nobody came to,” he says. 
‘A lot of people used the money to keep the people they already had” 

And despite the early political pressure to get the money out of the 
door quickly, agencies have allowed funds to be released gradually, to 
avoid waste. The NIH quietly relented earlier this year and is permitting 
‘no-cost extensions’ that allow team leaders, including Lord, to defer 
spending and wind down their projects neatly. The agency has now 
spent $7 billion of its $10.4-billion allocation, and Sally Rockey, the 
NIH’s deputy director of extramural research, says that “a big chunk of 
it will still be ongoing” until 2013. 

The result, at Duke as elsewhere, has been that the ARRA funding 
will help to sustain research departments for at least another two years. 
Siedow says that a detailed internal analysis suggests that Duke will lose 
only “a very small number” of academic positions as the funding winds 
down. “Wed been concerned that funding would fall off a cliff; but so 
far, it hasn't happened, he says. 


MEASURED RECOVERY 

Other research funding agencies report a similarly gradual tail-off. The 
Department of Energy's Office of Science says that it expects substantial 
spending to continue for some time — it expects to disburse about $260 
million in fiscal year 2012 and $170 million in fiscal year 2013. The NSE, 
which was under less pressure than the NIH to rush the money out, has 
so far distributed most of it to previously unsuccessful applicants for 
standard 3-5-year NSF grants. Only $1.4 billion of the agency’s ARRA 
allocation has been spent so far. 

Despite this stretching out and morphing, the ARRA is set to leave 
a distinct legacy in a few areas. One of these is metrics. Kei Koizumi, 
assistant director for federal research and development at the White 
House Office of Science and Technology Policy, says that ARRA fund- 
ing is “the most extensively documented set of research investments 
we've ever seen’. 

Because of the initial expectation that stimulus funding would 
generate jobs, its recipients are required to report regularly on the 
employment impact of their funding. Their returns are compiled and 
posted immediately on a public website (www.recovery.gov); 
these responses suggest that the NIH ARRA funding, 
for example, is currently supporting the equivalent of 
about 20,000 full-time postions. However, such self- 
reported data are not always accurate. 

The passage of the stimulus bill also gave birth 
to Star Metrics, an ambitious cross-government 
programme set up to monitor research inputs and 
outputs including, in its first phase, jobs (see Nature 
465, 682-684; 2010). Star Metrics started off track- 
ing the employment impact of ARRA and non-ARRA 
research spending at about 70 volunteer research uni- 
versities, representing an estimated 40% of the US univer- 
sity research system. The results of this analysis, which takes 
its data directly from university payroll systems, are set to be published 
shortly. 

But further detail about the impact of ARRA research spending may 
be slow to emerge. Congressional committees have shown little inter- 
est so far in raking over the data, according to research lobbyists and 
university leaders. “Priorities change very rapidly now, says Chris King, 
a staff member with the House of Representatives science committee. 
Formal programme evaluation, as envisaged by Star Metrics, is just “not 
very critical for people on Capital Hill”, he says. 

In the meantime, Star Metrics has shifted its focus away from the 
ARRA, towards the impacts of all research. “What has happened is 

that the notion of accountability has spread out 


> NATURE.COM much more broadly,” says Julia Lane, an econo- 
For a breakdown of mist and the NSF official who leads Star Metrics. 
the 2009 stimulus The programme is planning a second stage, in 
spending, see: which every researcher in the United States who 
go.nature.com/xlxeel receives federal funds is assigned an ‘identifier; 
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opening the road to a national database that keeps tabs on everything 
from total federal funding and employment to publications, citations 
and patent applications. 

Although much of the ARRA research money has gone on grants, 
assumed to be the fastest way to spend the money, some of it is build- 
ing a concrete legacy in the shape of investment in bricks, mortar and 
instrumentation. These include a handful of stand-alone projects — 
such as a $16-million lab-animal facility at Duke — and many injections 
into projects already under way. 

One of the largest such investments is the $150 million being spent 
by the Department of Energy to speed up construction of the $1-billion 
National Synchrotron Light Source II: a high-energy photon source at 
Brookhaven National Laboratory on Long Island, New York, that will 
enable materials scientists and structural biologists to look at molecules 
and crystals at the nanoscale. The project was approved in 2005, but the 
energy department lacked the funds to build it quickly; the facility is 
now expected to be finished in 2014. “ARRA enabled us to pull forward 
the construction work, and we'll have a more fully outfitted facility, with 
better beamlines and laboratories,” says Steve Dierker, head of photon 
science at Brookhaven. The facility will be 10,000 times brighter than 
the light source that it replaces, says Dierker. 

The stimulus has also funded experiments in approaches to research 
funding. Some of these set out specifically to support ‘high-risk 
research projects that might struggle to attract support through the 
normal grant peer-review process — most notably at the Department of 
Energy's new branch, the Advanced Research Projects Agency—Energy 
(ARPA-E), which was conceived in 2004 but got funded, to the tune of 
$400 million, only under ARRA. ARPA-E has been backing research 
in untested areas, such as the development of microbes engineered to 
produce biofuels. 

More commonly, as at the NIH, ARRA has forced a compression of 
the peer-review process, and resulted in grants that last for less time 
than normal. Some of these, such as Lord’s, allow research groups to 
expand or wrap up projects, but not necessarily test a new hypothesis. 
“We did an experiment with two-year grants, and we'll follow up to see 

how that worked,” says Rockey. 
One of the biggest legacies of the science stimulus, how- 
ever, will be that thousands of students have stayed in 
science longer than they might otherwise have done. 


“WE GAVE A PARTY 
THAT NOBODY CAME TO. 


“ARRA has provided postdocs and graduate students 
with great training that'll help them with the rest of their 
careers,’ says Rockey. “These people will move on to do produc- 
tive and wonderful things.” 

Another legacy will be the ideas, large and small, that have been ger- 
minated by ARRA funding — even if their creators are uncertain that 
they can bring them to fruition. Dan Gauthier, a physicist at Duke, used 
his three-year, $480,000 NSF grant to study the interaction of laser light 
with very cold rubidium atoms. Gauthier is delighted to have had the 
chance to pursue a project that would otherwise have been out of reach; 
he thinks that his findings could eventually help in the development of 
quantum communication networks. 

At the same time, he worries about next summer, when the grant sup- 
porting him and his two research assistants will end. To keep his team 
intact, he will have to win normal funding from the NSE. “If we don't 
get funded, it'll just be devastating,” he says. “This was all predicated 
on the idea that the economy would get better after two or three years: 
now we dont know that it will? = 


Colin Macilwain is a freelance writer based in Edinburgh, UK. 
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WAYS TO 
CLINICAL 


HE 
[AL 


Clinical trials are crumbling under modern 
economic and scientific pressures. Nature 
looks at ways they might be saved. 


BY HEIDI LEDFORD 


eveloping a drug is a costly gamble. 

Getting one to market takes, on 

average, more than ten years and a 

billion dollars. About 85% of ther- 
apies fail in early clinical trials. And of those 
that survive through to phase III, generally the 
last step before regulatory approval, only half 
will actually be approved. 

Although a promising compound can 
fail for many reasons, from safety concerns 
to corporate decisions, many say that a sig- 
nificant number of good drugs are being lost 
to outdated and impractical clinical-trial 
designs (see ‘The clinical-trial cliff’). The 
drugs may work, says Lillian Siu, an oncolo- 
gist at Princess Margaret Hospital in Toronto, 
Canada, “we just don't know how to test them 
appropriately,” 

Solving the problem may require 
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fundamental changes to the clinical-trial 
system to make it faster, cheaper, more adapt- 
able and more in tune with modern molecular 
medicine. The old paradigm, established in 
the 1960s, was based on single trials, carried 
out at single sites, and designed to answer a 
single question, says Rachel Sherman, associ- 
ate director for medical policy at the Center 
for Drug Evaluation and Research at the US 
Food and Drug Administration (FDA) in 
Bethesda, Maryland. “But that’s not the world 
were living in now” 

Today, the world is more risk-averse, and 
demands larger trials to pinpoint safety con- 
cerns. Compounds that confer only small 
benefits when compared with existing drugs 
require large sample sizes for the results to 
be statistically significant. As a result, tri- 
als have become bigger, and often occur at 
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multiple sites, even in multiple countries, and 
can involve thousands of personnel. 

The long-heralded era of personalized 
medicine — tailoring treatments or com- 
binations of treatments to a specific patient 
— adds its own complications. Such an 
approach has the potential to lower the fail- 
ure rates of investigational drugs by testing 
them only in the individuals most likely to 
benefit. But research teams struggle to iden- 
tify biological markers that can be used to 
stratify patients by the characteristics of their 
disease, and when they do, diseases can get 
splintered into rare subtypes that each affect 
just a few individuals. This means that 
researchers must screen a much larger pool 
of potential participants. 

Some researchers are working to improve 
the fortunes of potential therapeutics, however. 
Here are four ways that they have come up with 
to give clinical trials a better success rate. 


RECRUIT EARLY 

Patient recruitment has been 

a major stumbling block. 

At least 90% of trials are 

extended by at least 6 weeks 

because investigators fail to 
enrol patients on schedule. Only about one- 
third of the sites engaged in any multicentre 
study ever manage to enrol the requisite num- 
ber, says Kenneth Getz, an expert on clinical 
research at the Tufts Center for the Study of 
Drug Development in Boston, Massachusetts. 
The result: longer, more expensive trials — 
some of which may never be completed. 

Personalized medicine, which has moved 
apace in cancer research and development, 
exacerbates the recruitment problem. George 
Sledge, an oncologist at Indiana University in 
Indianapolis, offers the example of a class of 
drug that inhibits enzymes called kinases. 
Imagine a trial that targets two kinases, one 
of which is mutated in 25% of patients, and 
the other of which is mutated in 8%. “You'd 
have to screen about 50 patients to find one 
eligible for the trial” he says. For more com- 
plex combinations and more diverse patient 
pools, the problem “gets to be virtually insolu- 
ble’, he says. 

Researchers and patient advocates are try- 
ing to make it easier to find eligible volunteers. 
They are taking a page from organizations such 
as the Alpha-1 Foundation in Miami, Florida, 
which has created a registry of patients with 
alpha-1 antitrypsin deficiency, a disorder that 
makes them particularly susceptible to lung 
and liver diseases, who are willing to be con- 
tacted about clinical trials. 

The Moffitt Cancer Center in Tampa, Flor- 
ida, for example, runs a Total Cancer Care 
programme that unites 18 hospitals, compil- 
ing medical history, tissue samples and genetic 
information about each patient’s tumour. 
Samples are all stored for future analysis, and 
patients can consent to doctors contacting 
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Drug companies are removing more compounds from the pipeline at all levels of testing than ever before. 
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them about trials. A similar union of four 
institutions is in the works in the Netherlands, 
says Jan Schellens, an oncologist and pharma- 
cologist at the Netherlands Cancer Institute in 
Amsterdam. 

Negotiating agreements across institutions 
can be tricky. “Each brings its own values, pref- 
erences and interpretation of the privacy laws 
to the table,’ says Walter Kernan, a neurologist 
at Yale University in New Haven, Connecticut. 

“To set up a system in which a hospital pro- 
vides investigators at another institution with 
names of patients and diagnoses requires enor- 
mous trust.” 

Several groups that have been talking about 
forming networks are likely to face technical 
barriers. They need to develop appropriate 
patient-consent forms, unified databases and 
ultra-secure networks to connect hospitals. 
“Many of these groups don't have anywhere 
near the information-technology networks 
youd need to carry this off,’ says Sledge. But 
that doesn’t mean it won't happen, he adds. “It's 
the wave of the future, but it’s not going to be 
simple” 


SKIP ANIMALS 
In 2009, researchers at the 
National Cancer Institute 
(NCI) in Bethesda faced a 
dilemma. They had evidence 
that blocking a protein called 
AKT, involved in cell death and proliferation, 
could stave off cancer, but initial animal studies 
on a compound that blocked the protein sug- 
gested that the drug was poorly absorbed by 
the human body. So researchers developed five 
formulations of the drug. The question was, 
should they test each one in animals, or could 
they skip ahead and evaluate them in humans? 
In 2006, the FDA and the European Medi- 
cines Agency (EMA) introduced guidelines 
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for testing very small ‘microdoses’ of drugs 
in humans. These are concentrations less 
than a one-hundredth of the therapeutic 
dose. Because the concentrations are so low, 
the drugs can be tested in a small number of 
patients without the level of safety data nor- 
mally required before a phase I study. These 
early tests, dubbed ‘phase 0’ studies, would 
show, for instance, how the drug is distributed 
and broken down in the body, and whether it 
hits the right molecular target. 

Paul Limburg, a gastroenterologist at the 
Mayo Clinic in Rochester, Minnesota, decided 
to launch a phase 0 study of the new AKT drug 
formulations. The results, published in March, 
allowed the team to pick the one absorbed best 
by the body for use in future trials (J. M. Reid 
et al. Cancer Prev. Res. 4, 347-353; 2011). 

Proponents of phase 0 testing argue that it 
makes sense to get human data quickly. About 
one-quarter of the molecules entering clini- 
cal trials fail because of “poor pharmacology’, 
says Schellens. The drug may not be readily 
absorbed, for example, or may not reach its 
target organ. With a simple test in humans, he 
says, “you could kill that drug much earlier, 
before you have invested so much time and 
money’. 

But Razelle Kurzrock at the MD Anderson 
Cancer Center in Houston, Texas, is concerned 
that developers will kill promising drugs in 
response to negative microdose data, even 
though the drug may work at higher, thera- 
peutic concentrations. To test this hypothesis, 
she and her colleagues are conducting phase 0 
trials with FDA-approved drugs to find out 
whether any would have failed at that stage. 

And phase 0 trials may be tiny, but they are 
not easy. They require a test sensitive enough 
to detect the minute quantities of the drug in 
the body — and possibly also ways to track 
its mechanism of action. For veliparib, a )» 
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potential anti-cancer drug, Abbott Laborato- 
ries, headquartered in Abbott Park, Illinois, 
started developing assays that were robust 
and sensitive enough to detect tiny concen- 
trations of the drug more than a year before 
its researchers embarked on a phase 0 trial, 
says James Doroshow, the NCI oncologist who 
led the trial. 

Few, says Schellens, plan so far ahead. “Often 
when people start thinking about the phase 0 
trial in the drug development process, they are 
too late” 


USE MODELS 

Although phase 0 trials could 

help to wean researchers off 

pharmacology studies in ani- 

mals, moves are also afoot 

to bring mouse experiments 
closer to the clinic. 

The problem with most of today’s animal 
studies, says Eric Holland, a neurosurgeon 
at the Memorial Sloan-Kettering Cancer 
Center in New York, is that they typically 
focus on safety rather than efficacy. And they 
rarely predict exactly what will happen in the 
clinic, because the doses, formulations and 
schedules of medication differ from those 
given to the animals. A person with prostate 
cancer, for example, does not usually take an 
experimental medication until the standard 
hormone therapy has failed. But the mice that 
experimental drug was tested on are unlikely 
to have received the same treatment. 

Pier Paolo Pandolfi, a cancer researcher 
at Harvard Medical School in Boston, has 
therefore pioneered a technique called the 
‘co-clinical trial, in which mice with similar 
disease characteristics are treated in a simi- 
lar way to the humans. Holland recently used 


be given the same treatment as the patient, 
so that researchers can track how the tumour 
cells respond, and use this information to tai- 
lor future treatment for the patient. 


ALTER COURSE 
Perhaps nowhere in science 
is ignorance prized as highly 
as in clinical trials. To keep 
expectations or biases from 
inadvertently influencing 
the results, patients and investigators are often 
kept in the dark about who is receiving what 
treatment until the end of the study. At best, 
an external committee may take a secret peek 
at the results mid-trial to make sure it is safe 
for the experiment to continue. 

But looking just once at the data is “like 
driving home from work and only opening 
your eyes once to see where you're going’, says 
Donald Berry, a statistician at the MD Ander- 
son Cancer Center. 

Berry specializes in designing ‘adaptive’ tri- 
als, which can change course as the data roll 
in. Ifone treatment regimen seems to be more 
successful, for example, researchers might 
increase the proportion of participants that 
should receive that treatment. These trials can 
also be used to identify biological markers, 
such as mutations or altered metabolite levels, 
associated with the success or failure ofa given 
regimen. 

For example, before women enrol in an 
ongoing adaptive trial called I-SPY2, run by 
Berry and Laura Esserman, an oncologist at 
the University of California, San Francisco, 
their breast-cancer tumours are biopsied and 
tested for genetic markers. The genetic make- 
up of their tumours determines whether they 
are eligible to enter the trial, and which group of 


AS THE MOLECULAR ERA ARRIVES, 
ADAPTIVE TRIALS WILL BECOME MORE 
AND MORE RELEVANT. 


the approach to test combinations of drugs 
that target AKT and another protein, called 
mTOR (K. L. Pitter et al. PLoS ONE 6, e14545; 
2011). The mouse trial allowed the researchers 
to try more concentrations than the human 
trial, and suggested new ways of screening 
patients for inclusion in future clinical trials. 
“The more complicated the trial, the more 
likely it is that the co-clinical strategy would 
be beneficial? Holland says. 

In 2009, the NCI invested US$4.2 million 
in Pandolfi’s co-clinical trials in prostate 
and lung cancer. In them, mice receive the 
same therapies as people do before the start 
of experimental treatments. At The Jackson 
Laboratory, based in Bar Harbor, Maine, 
which has a large personalized medicine pro- 
gramme, researchers graft tumour cells taken 
from patients into mice. These mice can then 


treatments they should be randomized to. But 
because the results are reanalysed each time a 
woman completes her treatment, the partici- 
pants are more likely to receive a treatment that 
has worked in people with genetically similar 
tumours. If a treatment does not perform well 
in any patients, it is cut from the programme; 
other treatments can then be added to the mix. 
The FDA and the EMA have encouraged drug 
developers to embrace these designs. In addition 
to high-profile trials such as I-SPY2, Berry says 
that he has designed trials running the whole 
gamut of disease, from diabetes to pandemic flu. 
“Virtually every pharma company is doing this,’ 
he says. But Getz disagrees and says that he's seen 
amore muted response to adaptive trials. “Our 
clinical-trial protocols have become too com- 
plex across the board,” says Getz. “And adaptive 
designs add to the logistical complexity” 
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One issue is statistical: the more tests on the 
data a researcher conducts, the more likely it is 
that they will introduce false positive results. 
Scott Evans, a biostatistician at the Harvard 
School of Public Health in Boston, is more 
concerned about bias, however. Even if it is 
an independent panel that views the data and 
adapts the trial, it is often impossible to hide 
a change in protocol from everyone. “Any in- 
flight adjustment you make in a clinical trial is 
potentially observable,’ he says. “Now you've 
created the potential for an operational bias.” 

“There has been a lot of controversy about 
adaptive trials,’ Siu says. “But as the molecular 
era arrives, they will become more and more 
relevant?” 


TEAR DOWN OR TWEAK? 
While researchers dream up new, better 
ways to design clinical trials, many involved 
acknowledge that the changes with the biggest 
impact will probably be more bureaucratic 
than conceptual. 

Simply standardizing the forms used to 
record clinical-trial data would reduce costs 
and cut down on record-keeping errors and 
omissions. The NCI is now rolling out a data- 
management system that will standardize data 
entry across all 2,000 sites that conduct NCI- 
sponsored trials, says Doroshow. The FDA is 
also looking at ways to cut down on reporting 
requirements and paperwork, so that investi- 
gators can submit summaries of case reports 
rather than each individual document. 

Trials that involve multiple sites also have 
to get approval from each institutions’ review 
committee, referred to as an institutional review 
board. This can take months or even years. 

To adapt to the multicentre climate, the US 
Office for Human Research Protections in 
Rockville, Maryland, which oversees human 
studies funded by the US National Institutes 
of Health, has proposed changes to its guide- 
lines that would require designation of a sin- 
gle review board for each project. However, 
individual research centres may be reluctant 
to loosen their hold on the reins. 

The clinical-trial system may be outdated, but 
at least it is not inflexible. Most radical designs 
have probably been tried in some capacity. 
Getting rid of control groups? It happens in 
rare-disease trials. Blending phases? Some 
trials are already starting to do this. Still, even 
small tweaks require a leap of faith when they 
are being made to a complex system with such 
high stakes — the protection of human health. 

The way that clinical trials are designed and 
run may need to be revolutionized, but “there's 
not going to be a single solution because it’s a 
multifactorial process,” says Richard Schilsky, 
an oncologist at the University of Chicago, IIli- 
nois. “Everybody who is a stakeholder in the 
clinical-trial process has to contribute to the 
solutions.” = 


Heidi Ledford writes for Nature from Boston. 


ILLUSTRATIONS BY JONATHAN BURTON 


CO 


Why governments New tools to Icelandic singer Universities should 
should measure well-being address old genomic Bjork combines music spend less on bureaucracy 
and promote the good life p.532 questions p.534 and nature in an app p.537 and more on researchers p.538 


Fund people not projects 


John P. A. Ioannidis proposes ways to save scientists from 
spending all their time writing grants. 


r | Vhe research funding system is broken: 
scientists don’t have time for science 
any more. Because they are judged 

on the amount of money they bring to their 

institutions, writing, reviewing and admin- 
istering grants absorb their efforts’. The 
requirement that they promise taxpayers 
specific results to justify research tends to 
invite either exaggeration or boringly pre- 
dictable projects. Yet the research behind 

30% of the pivotal papers from Nobel lau- 

reates in medicine, physics and chemistry 

was done without direct funding’. 
Every scientist recognizes this problem 


and hopes fora solution. Although detailed 
proposals may be indispensable for some 
projects, such as rigorous clinical trials and 
large-scale collaborative research, ideas 
abound for more efficient ways to fund 
general research. Some organizations are 
already experimenting. Multiple options 
could co-exist, with portions of the budget 
earmarked for different schemes. 

Here are some of the most promis- 
ing proposals to reduce the amount of 
time scientists spend trying to fund their 
research, and the pros and cons of each (see 
table). Definitive fixes would require major 


system overhauls, which are likely to make 
some scientists justifiably nervous. But 
smaller, pilot efforts that enable us to evalu- 
ate what works could begin right away. 


FUND EVERYBODY (OR A LUCKY FEW) 

Some — or all — of the research budget 
could be allocated to eligible scientists in 
equal shares, or given to a few lucky ones 
at random. With egalitarian sharing, each 
scientist would receive only a small amount, 
which could quickly evaporate without 
returns when research costs are high. But 
scientists in some fields — mathematics, 
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> say — could achieve much on a small 
share; and in some settings the shares could 
be substantial. For instance, if half of the 
US$31.2 billion that the US National Insti- 
tutes of Health spends on research each year 
was shared equally among 300,000 research- 
ers, each would get more than $50,000 a year. 

Lottery distribution, too, flies against the 
principle that science funding should be 
meritocratic. Still, some agencies are trying 
it — the Foundational Questions Institute 
in New York, which tackles key questions 
in physics and cosmology, uses a lottery sys- 
tem to award its mini-grants, which range 
in value from $1,000 to $15,000. Such an 
approach may not be as radical as it sounds: 
the imperfections of peer review mean that 
as many as one-third of current grants are 
effectively being awarded at random’. This 
situation will only worsen as falling accept- 
ance rates encourage investigators to bom- 
bard agencies with proposals, leaving fewer 
qualified reviewers to judge each one. The 
downside of using aleatoric allocation is that 
not every deserving scientist will be funded. 


FUND ACCORDING TO MERIT 

Leading thinkers and experimenters worthy 
of unconditional support could be identi- 
fied through peer assessment of their work 
and credentials. Appraisals of project-based 
proposals already take a scientist’s merit into 
account, but they typically give less weight to 
it than to the project plan. Peer assessment 
does not work well for early-career scientists, 
who have a short track record. But for those 
more established in their field, a career trajec- 
tory offers a wealth of information. By con- 
trast, an isolated project is only a snapshot. 


OPTIONS FOR REVAMPING THE FUNDING SYSTEM 


Option Pros 


Egalitarian Avoids peer-review biases 


(fund everybody) Gives sufficient amounts to 
scientists doing low-cost research 
Small administrative burden 

Aleatoric Avoids peer-review biases 


(fund at random) 


Assessment of 
career 


Captures career trajectory 
Has gold-standard status 


Automated impact Eliminates favouritism 
indices 
ease 

Approaches objectivity 


Scientific citizenship 


ones penalized 


Projects with 


broad goals review 


Permits targeted innovation 


Small administrative burden 


Evaluates many applicants with 


May improve science, if good 
practices are rewarded and bad 


Proposals are easy to write and 


Formulating work can be flexible 


In the MacArthur Fellows Program, 
for example, meticulous peer assessment 
is used to select 20-30 individuals a year 
on the basis of exceptional creativity and 
promise for important advances. Recipi- 
ents do not have to justify what they do 
with the $500,000 award, which is spread 
over 5 years. However, close scrutiny of an 
individual's career may become prohibitive 
for systems that award thousands of grants 
— it might save grant recipients time, but it 
adds to the administrative load of reviewers. 
The approach is also vulnerable to favour- 
itism, in which only elite individuals and 
lines of research 
are selected and 


“Itis ascandal h Ai of oc 
that billionsof — "ousanes of sct- 
entists doing qual- 

dollarsarespent . 1 1 
anseoenth ity, smaller-scale 
hie science are left out. 
he ou h To avoid the 
knowing the subjectivity and 
best way to burden of evaluat- 
distribute that ing thousands of 
money. careers, an auto- 


mated system to 
evaluate relative merit would have to be 
devised. Such a system would depend on 
objective indices. The share of the annual 
funding budget scientists receive would be 
based on their value, calculated with a pre- 
specified formula. 

Metric-based appraisals are familiar to 
many scientists already, particularly those 
in European countries. The UK Research 
Assessment Exercise, for example, relies on 
them. It isa much hated and debated system 
for evaluating departments, but its replace- 
ment, the Research Excellence Framework, 


Cons 


Does not support large research efforts 
Does not recognize exceptional scientists 


Will not capture all deserving scientists 


Is vulnerable to favouritism 
Inappropriate for young researchers 
ls labour-intensive 


about best one to use 

Indices can be gamed 

Databases have shortcomings (such as 
imperfect citation coverage, entry errors, name 
disambiguation problems) 


and is difficult for some citizenship practices 
Has peer-review biases 


Does not eliminate project proposals 

Is vulnerable to favouritism 

Holds potential for exaggerated promises and 
claims 


There are many indices, all with flaws; no consensus 


Automation is not yet possible for data gathering, 


will rely even more on indices when it comes 
into effect in 2014. Metrics also underlie 
many hiring decisions in Italy, a country 
that is struggling to remedy widespread 
nepotism, and in Germany’s Max Planck 
institutes. However, most of these assess- 
ments are simplistic, focusing on number 
of peer-reviewed publications, or inappro- 
priate — looking at the impact factor of the 
journal rather than of an individual article. 
More sophisticated formulae are needed if 
a scientist’s merit is to be captured. 

Furthermore, indices are open to 
gaming, although some are more difficult 
to influence than others. To counter this, 
the system could use indices that exclude 
self-citations and capture quality rather 
than quantity (such as average citations 
per paper instead of number of papers), 
discourage gift authorship by adjusting for 
co-authors and penalize quantity that is not 
accompanied by quality. Several metrics 
could be combined. 

Funding systems could reward good 
scientific citizenship practices, such as 
data sharing’, high-quality methods, care- 
ful study design and meticulous reporting 
of scientific work’. Openness to collabora- 
tion, non-selective publication of ‘negative’ 
findings, balanced discussion of limitations 
in articles and high-quality contributions 
to peer-review, mentoring, blogging or 
database curation could also be encour- 
aged. Researchers might be rewarded for 
publishing reproducible data, protocols 
and algorithms. However, some citizenship 
practices are difficult to capture in auto- 
mated databases, so would be subject to the 
disadvantages of peer assessment. 


Example Who would be funded? 


Some universities All 
fund the salaries of 
all their faculty 


Foundational Flexible 


Questions Institute 


MacArthur Fellows Few elite scientists 


Program (or else administratively 
burdensome) 

UK Research Flexible 

Excellence 

Framework 


Could be extended to 
many scientists only 
for aspects that can be 
automated 


Financial incentives 
to peer reviewers 


NIH Director's 

Pioneer Awards 
Howard Hughes 
Medical Institute 


Few elite scientists 


Two or more options can also be combined (for example, automated impact indices plus evaluation of scientific citizenship). 
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STATE BROAD GOALS 


Another way to save time would be to sim- 
plify the application. Researchers could be 
asked, for example, to submit short summa- 
ries of their intended research, describing 
broad goals only. Such applications require 
less effort to write, review and administer 
and would allow flexibility in carrying out 
the work, if funded. Examples include the 
NIH Director’s Pioneer Awards and the 
Howard Hughes Medical Institute (HHMI) 
grants. The HHMI selects 300 established 
investigators and 50 young investigators 
through peer assessment of their creden- 
tials and of proposals for high-risk, uncer- 
tain prospects of innovation. Its awards are 
usually renewed after 5 years, requiring 
simple documentation of effort rather than 
demonstrated results, although results are 
needed for renewal after 10 years. Never- 
theless, any system that demands high-risk 
innovative goals, and requires results, gen- 
erates potential for exaggeration. 


IGNORE GRANT PORTFOLIOS IN PROMOTIONS 

Many institutions use the size of a scientist's 
grant portfolio as a basis for tenure and pro- 
motion. This practice may prompt scientists 
to prepare multiple grant applications focus- 
ing on expensive, even if dull, projects and 
to abandon brilliant ideas that need limited 


funding to test. But going after many and 
expensive grants costs institutions money, 
because both scientists and administrators 
spend an inordinate amount of time evalu- 
ating proposals, supplements, timesheets 
and justifications, as well as progress and 
final reports. 

Many large projects never result in a 
scientific achievement, so even if the strategy 
brings in short-term grant funding, it may 
not pay off in the long term. The size of a 
portfolio should therefore not bea criterion 
for promotion; committees should focus 
instead on real work and achievements. 
Judging scientists by the size of their portfo- 
lio is equivalent to judging art by how much 
money was spent on paint and brushes, 
rather than the quality of the paintings. 


WHAT WE CAN DO NOW 

All of the options above could be achieved 
— either through small, progressive steps, 
or through more extensive changes to the 
system. A major overhaul is likely to take 
years to implement, and will meet with wide 
resistance and debate. Smaller steps, such as 
changing promotion and tenure criteria, 
are easier to make. Pilot programmes of 
proposal-free or broad-goal-based funding 
can be incorporated into existing funding 
structures. 


There are issues still to be resolved. All 
funding options face a tension over how 
many scientists should receive awards, and 
there is no good evidence on whether it is 
better to give fewer scientists more money 
or to distribute smaller amounts between 
more researchers. Some funding schemes 
are well suited to funding numerous scien- 
tists; others favour elitism (see table). 

We will need to find ways to figure out 
which approach works best. It is a scandal 
that billions of dollars are spent on research 
without knowing the best way to distribute 
that money. Retrospective assessments are 
easy, but subject to confounding when com- 
paring groups that were funded through 
different schemes. For example, one study 
has found that HHMI-funded investiga- 
tors publish more high-impact papers 
and get more recognition than matched 
NIH-funded peers’. But it’s impossible to 
match scientists perfectly: the prestige of 
the HHMI name alone may lead to more 
peer-based recognition. Prospective com- 
parisons are more reliable, but require long 
follow-up. For example, controlled trials 
could randomize consenting scientists to 
different funding schemes, then compare 
surrogate metrics and long-term successes. 

Ultimately, funding schemes should sup- 
port the long-term goals of science. Few 
isolated research efforts have an immediate, 
substantial and durable impact; successful 
translation of basic research to practical 
applications occurs sparingly and with 
average delays of almost three decades’. 
The aim of science is to expand our knowl- 
edge base, which, eventually, yields useful 
applications. This is what scientists entered 
their profession to do, so requiring them 
to spend most of their time writing grants 
is irrational. It’s time to seriously consider 
another approach. m 


John P. A. Ioannidis is the C. E Rehnborg 
chair in disease prevention, professor of 
medicine and of health research and policy, 
and director of the Stanford Prevention 
Research Center at Stanford University 
School of Medicine, and professor of statistics 
at Stanford University School of Humanities 
and Sciences, Stanford CA 94305. 

e-mail: jioannid@stanford.edu 


1. Editorial. Sci. Am. 304, 10 (2011). 

2. Tatsioni, A., Vavva, E. & loannidis, J. P. FASEB J. 
24, 1335-1339 (2010). 

3. Graves, N., Barnett, A. G. & Clarke, P. Nature 469, 

299:(2011). 

. Baggerly, K. Nature 467, 401 (2010). 

. Simera, |., Moher, D., Hoey, J., Schulz, K. F. & 
Altman, D. G. Eur. J. Clin. Invest. 40, 35-53 
(2010). 

6. Azoulay, P., Graff Zivin, J.S., & Manso, G. 
Incentives and Creativity: Evidence from the 
Academic Life Sciences (NBER Working Paper 
No. 15466. 2009). 

7. Contopoulos-loannidis, D. G., Alexiou, G.A., 
Gouvias, T. C. & lonnadis, J. P. A. Science 321, 
1298-1299 (2008). 


os 


29 SEPTEMBER 2011 | VOL 477 | NATURE | 531 


© 2011 Macmillan Publishers Limited. All rights reserved 


Family support in Brazil bolsters well-being even when economic resources are limited. 


E. MARTINO/PANOS 


Time to legislate for the good life 


Charles Seaford argues that a clear measure of well-being should be devised to 
help people judge how government policies affect their quality of life. 


David Cameron remarked that a pro- 
gramme under way at the UK Office for 
National Statistics (ONS) to collect social data 
and develop decision-making rules based on 
well-being “may be the most quietly radical 
thing this government is doing”. The riots in 
the United Kingdom last month — an expres- 
sion of social dissatisfaction that was more 
than a response to economic conditions — 
highlight the importance of this approach. 
Governments in France, Italy, Germany, 
Spain, the United States, Australia, China 
and Ecuador are also taking steps to measure 
quality of life as well as economic growth. The 
European Commission and the Organisation 
for Economic Co-operation and Develop- 
ment are harmonizing these efforts. And the 
General Assembly of the United Nations has 
called upon member states to place greater 
emphasis on measures of well-being in their 
evaluations of social and economic devel- 
opment. My colleagues and I at the New 
Economics Foundation (NEF) in London 
have been arguing for sucha focus since 2001; 
we are now working with the UK government 
and the European Commission to ensure 
that these efforts are translated into practical 
implications for public policy. 
This interest in well-being — and its subjec- 
tive measurement — is good news. Economic 


E a speech in May, British prime minister 
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growth is just one of many tools for bringing 
about good lives. Political decisions involve 
trade-offs — between, say, fostering economic 
growth and stable communities, or agreeable 
urban landscapes. The traditional focus on 
gross domestic product (GDP) as a target 
biases these decisions. The result is lower lev- 
els of public well-being than could be the case 
if people's quality of life was the priority. As 
economic activity places a greater strain on 
the environment than many other routes to 
happiness — suchas spending time with one’s 
family — this bias is also bad for sustainability 
(see ‘Good lives needn't cost the Earth). 
There are two key challenges for research- 
ers, politicians and policy-makers: first, to 
gather and interpret new data, so as to create 


SUMMARY 

@ Governments worldwide should 
measure and monitor well-being using 
national surveys. 


@ An internationally accepted well- 
being statistic should be developed and 
communicated to the public. 


@ Public debate around this statistic will 
prompt policy-makers to maximize well- 
being over economic growth. 
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a much fuller science of well-being to rival 
traditional economics; and second, to cre- 
ate public understanding of some headline 
measure of well-being and of the role of pol- 
icy in influencing it, in order to create the 
political will to use the new science. 


THE EVIDENCE 
Well-being is variously defined. Psycholo- 
gists see it as ‘good functioning or the meet- 
ing of psychological needs’, an approach that 
emphasizes relationships, autonomy, com- 
petence and purpose. Economists use more 
abstract terms such as ‘happiness’ or “utility” 

Social surveys over several decades have 
shown that economic and social policies 
affect aspects of well-being, however it is 
defined. Income correlates with well-being, 
but only up to a certain level, which varies 
between countries. In the United States, for 
example, earnings above US$75,000 don't add 
much more happiness’. Studies also reveal 
that loss of income is more damaging than 
a gain is beneficial, and unemployment is 
more damaging to well-being than is the con- 
sequent loss of income. Casual employment 
is bad for well-being, but self-employment 
is good, at least for those earning a decent 
income. Commuting is bad. 

Moderate economic growth is good for 
well-being, but the disruption to lives that is 


DATA FROM HAPPY PLANET INDEX 2.0, NEF 


associated with a rapid boom is bad. Inflation 
is harmful — but a 1% rise in inflation hurts 
less on average than a 1% rise in unemploy- 
ment. Fluctuating rates of inflation are worse 
than stable ones. High levels of short-term 
debt are associated with low values of well- 
being. Effective democratic structures, social 
trust, social networks and time spent social- 
izing are all important for well-being, as are 
access to parks and views of greenery for 
city-dwellers. 

Levels of well-being vary sharply between 
countries. Sub-Saharan Africa and Eastern 
European countries score badly. Scandina- 
via and Latin America do well. In general, 
there is a positive association between social 
equality and well-being, although there are 
exceptions: inequality is high in Latin Amer- 
ica, for example, but family support plays a 
strong role in social security”. 

So an economic policy driven by what 
we already know about well-being would 
place more emphasis than do current eco- 
nomically centred policies on securing full 
employment, on stability, and on preserv- 
ing and strengthening communities and 
ensuring a reasonable level of income for all. 
Governments have not yet made those 
changes. What will cause them to do so? 


A SINGLE MEASURE 
A headline statistic that captures well-being 
— a measure that voters recognize and can 
use to hold politicians to account, like the 
unemployment or economic growth figures 
— could make the difference. Unless voters 
care, politicians won't care. And voters wont 
care without a number — a measure that tells 
them how things have been going and might 
be projected to the future. The GDP statistic 
has had this effect since it was invented in 
1934, 

The Stiglitz commission, set up in 2008 by 
French president Nicolas Sarkozy, has laid 


GOOD LIVES NEEDN'T COST THE EARTH 


some of the groundwork for developing a 
measure of well-being’, but there is more to 
do. The measure must be able to move and be 
seen to move over time, in the way that GDP 
figures do. It must be in a format that allows 
international comparisons, perhaps account- 
ing for intercultural differences by focusing 
on movements rather than absolute values. 

Most important, it must link well-being 
to social and economic conditions — and 
that will require further research. The survey 
results published are rarely linked to par- 
ticular policy decisions. And if they are not 
‘official, the statistics may go unnoticed 
beyond a given field. The causal factors 
underlying correlations between happiness 
and social conditions are often poorly under- 
stood, the variables are difficult to control for 
and the results can seem inconsistent between 
studies. Large-scale studies that investigate 
many potential drivers of well-being and also 
include adequate controls are scarce. 

The United Kingdom is leading the way to 
better measurement with an annual survey 
by the ONS (on which the NEF has advised) 
that will probe the well-being and circum- 
stances of 200,000 people. Since April, the 
ONS has included in its Integrated House- 
hold Survey four subjective questions: how 
satisfied people are with their lives; how 
happy they were yesterday; how anxious 
they were yesterday; and how worthwhile 
they think the things they do are. The survey 
also asks a large number of questions about 
objective circumstances that are influenced 
by government policy — for example, health, 
housing, education, household income 
and employment patterns and benefit 
entitlements. 

With such information it will become 
possible to trace statistical relationships and 
thereby to assess the relative impact of vari- 
ous policies on well-being. Well-being will 
then become the common currency for a 
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new form of cost-benefit analysis, which will 
not replace political judgement and bargain- 
ing, but should inform it. Policy-makers will 
be able to ask, for example, what have been 
the relative impacts of steps to reduce unem- 
ployment in an area and steps to preserve 
the environment? Have attempts to increase 
community cohesion and increase economic 
activity worked? Have the public-health 
measures in one city been more effective at 
increasing well-being than those in another? 


THEORY IN ACTION 

Our recommendation to the ONS, and to 
other governments, is that the main meas- 
ure publicized each year should be based on 
perceived levels of well-being rather than the 
drivers of it. That is, it should be based on the 
responses to the four subjective questions 
asked in its household survey, rather than 
on the objective questions and the resulting 
data. It should take the form of a percentage 
of the population above a certain threshold 
level of well-being — the percentage that is 
flourishing. 

The analytical work identifying the driv- 
ers of this headline measure must also be 
presented, in a form that can be picked up 
by the media and can inform a public debate 
about which policies to pursue. Voters will 
come to see that movements in the measure 
— like those of GDP or inflation or carbon 
emissions — are the responsibility of politi- 
cians. Politicians will then have to take the 
well-being measures seriously, and act on the 
advice of the analysts. 

Such discussion must not be conducted 
behind closed ministry doors. The statisti- 
cal authorities should make the data publicly 
available at a level of granularity (including 
geographical detail) useful to a wide range of 
researchers. To probe causality, researchers 
will need funding for field experiments and 
qualitative work. And somehow, politicians 
and officials will need to be made to listen to 
the results even when they do not confirm 
existing views. 

All of this is possible, but it will not happen 
automatically. We must continue to push. = 


Charles Seaford is head of the Centre 

for Well-being at the New Economics 
Foundation, 3 Jonathan Street, London 
SE115NH. He is a member of the UK 
National Statistician’s Advisory Forum on 
Well-being. 

e-mail: Charles.Seaford@neweconomics.org 
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— Nie 


Inherited and environmental factors can switch genes on and off to give each of us, including monozygotic twins, a unique identity. 


EPIGENETICS 


Beyond face value 


Jonathan Weitzman relishes two accounts of how environment can influence the 


script of our genome. 


s an identical twin, I have always 
Aw fascinated by what deter- 

mines who we are. Nature’s clones 
are never truly identical, so what explains 
the differences between my brother and 
myself? How much of our identity is inher- 
ited; how much acquired by interacting 
with the environment? The field of epige- 
netics, standing at the interface between 
our environment and our genes, is begin- 
ning to offer answers. 

Epigenetics explores how genetically 
identical entities, whether cells or whole 
organisms, display different characteristics, 
and how these are inherited. The past century 
witnessed amazing advances in our under- 
standing of genetics, but secrets remain hid- 
den within the genome. Epigenetics research 
is now blossoming, offering a potential pana- 
cea for these post-genome blues. 

Two timely books open up this emergent 
field: Epigenetics by Richard Francis and 
The Epigenetics Revolution by Nessa 
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Epigenetics: The Ultimate Mystery of 
Inheritance 

RICHARD C. FRANCIS 

W. W. Norton: 2011. 224 pp. £19.99, $25.95 


The Epigenetics Revolution: How Modern 
Biology is Rewriting Our Understanding of 
Genetics, Disease and Inheritance 

NESSA CAREY 

Icon Books: 2011. 320 pp. £17.99 


Carey offer very different takes. Francis’s 
thoughtful and succinct book focuses on the 
narrative and the excitement of discovery, 
rather than on the nitty-gritty details at the 
molecular level. His personal tour includes 
anecdotes from his travels around the world 
and allusions to popular culture. 

Carey’s book is more DNA-centric, 
focusing on epigenetic mechanisms and the 
chemistry of chromatin, which defines how 
DNA is packaged around proteins in the 
nucleus. Her book combines an easy style 
with a textbook’s thoroughness. 


2011 
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Conveying epigenetics to a broad 
readership presents challenges. The field 
is in flux and the concepts are abstract 
and technically demanding. Researchers 
struggle to come up with a unanimously 
accepted definition of the term epigenet- 
ics. And there is confusion in the literature 
between epigenetic phenomena and epige- 
netic mechanisms. The first describes the 
transmission of heritable characteristics, 
thereby resembling genetics; the second 
focuses on how genes are regulated, and 
is more aligned with molecular biology. 
Francis is interested in the modes of inher- 
itance and how these are affected by envi- 
ronmental events. Carey refers more to the 
mechanistic events that change how genes 
are switched on and 


off without altering NATURE.COM 
the genes themselves. _ For Nature's special 
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gene activity. He presents the cell as a theat- 
rical production, noting that the standard 
view — with genes as directors, proteins 
as actors and other biochemicals as stage- 
hands — needs to be reassessed. As an alter- 
native to the ‘executive gene’ idea, in which 
the genome is in control, he urges readers 
to think about the ‘executive cell’. Francis 
relegates genes to members of the cast, and 
insists that they are regulated by, not regula- 
tors of, cellular states. 

A seasoned science writer, Francis is at 
his best when describing abstract concepts 
in a historical context. A chapter on devel- 
opmental biology reminds us that the idea 
that form emerges gradually during devel- 
opment (known as epigenesis) arose in 

response to an earlier 
“What makes concept, which held 


the fieldso that form is prede- 
exciting is termined from the 
that we have beginning (prefor- 
new tools to mationism). Francis 
address old cautions us not to fall 
questions. ” into the trap of using 


the ‘genetic blueprint 
or ‘genetic program’ metaphors, which he 
sees as contemporary preformationism or 
genomic determinism. He ends with an 
interesting concept — the ‘Janus gene’ — 
which has both the outward causal aspect 
of gene function and an inward aspect that 
is responsive to external cues. 

Carey’s book is full of illustrations and 
entertaining metaphors. She describes 
the nucleosome as DNA wrapped around 
“eight ping-pong balls... like a long liq- 
uorice whip around marshmallows”, and 
sees DNA as a film script, with plenty of 
room for interpretation and retakes. Carey’s 
experience of the biotechnology industry 
shows in her concluding remarks on the 
pros and cons of our growing understand- 
ing of epigenetics for drug discovery, and 
on understanding the impact of diet and 
environment on disease. 

These two bold attempts to bring epige- 
netics to a wide audience are at the vanguard: 
more books will follow as the field matures. 
But future authors should avoid using the 
words ‘mystery’ and ‘revolutior in their 
titles. What makes the field so exciting is that 
we have new tools to address old questions. 
Unravelling life’s mysteries? That's not revo- 
lution, it is why we become scientists in the 
first place. My twin brother and I are both 
professors studying genetics — I still won- 
der whether that choice was determined by 
genetics or the environment. = 


Jonathan Weitzman is professor of genetics 
at the Université Paris Diderot and director 
of the CNRS unit for Epigenetics and Cell 
Fate, Paris, France. 

e-mail: jonathan. weitzman@univ-paris- 


diderot.fr 


Books in brief 


Humanity 2.0: What it Means to be Human Past, Present and Future 
Steve Fuller PALGRAVE MACMILLAN 288 pp. £19.99 (2011) 

Forget Frankenstein and Metropolis, says sociologist Steve Fuller. 
Neuroscience and technology are on a collision course that will 
catapult us to ‘transhumanity’, a future in which the state of being 
human starts to look decidedly alien. Fuller works his way through 
science, policy, history and philosophy, covering human biological 
roots and our aspirations to transcend them; the possibility of 
enhanced evolution as technologies converge; a ‘new theology’ 
inspired by figures such as Jesuit palaeontologist-philosopher Pierre 
Teilhard de Chardin; and our necessary quest for moral horizons. 


The Lost Photographs of Captain Scott 
David M. Wilson LITTLE, BROWN 192 pp. $35 (2011) 
b Cas | ‘Lost’ is a refrain commonly associated with Antarctic explorer 
4 Robert Falcon Scott — lost race, lost hopes, lost lives. Now, a stunning 
record of his last months has been found, giving a glimpse of 
fieldwork at the ‘final frontier’. In keeping with the research aims of 
the expedition, Scott shot 120 photographs between Cape Evans and 
Beardmore Glacier before his team’s final, ill-fated push. Most of the 
J images — icy panoramas, portraits, snaps taken on the move — are 
published here for the first time, with text by polar historian David M. 
Wilson, great-nephew of Scott’s science chief Edward Wilson. 


Fool Me Twice: Fighting the Assault on Science in America 
Shawn Lawrence Otto RODALE 384 pp. $25.99 (2011) 
Less than 2% of US senators and representatives have a scientific 
ZLrificn background, but more than 40% are lawyers. Shawn Lawrence Otto 
> 7 — suggests that is why US policy-making is rich in rhetoric and poor in 
reasoning. Reminding us of the crucial separation of church and state 
at the birth of the United States, Otto traces shifts in national attitudes 
to science and technology — from early wonder through atomic-era 
a... fear to widespread rejection. That these shifts are happening precisely 
/ when big science is needed to tackle global challenges should, he 
says, push researchers to re-engage with politicians. 


But Will the Planet Notice? How Smart Economics Can Save 

the World 

Gernot Wagner HILL AND WANG 272 pp. $27 (2011) 

The plastic-bag militants have got it wrong, says economist Gernot 
Wagner. We can save the planet by cutting out scientists, politicians 
and environmentalists in favour of economists. It’s they, he claims, 
who have revealed our mixed-up thinking, as we buy locally grown 
apples chilled for months, or clear rainforests to grow soya. Wagner 
calls for smart economics, such as capping pollution and paying 
for free resources. Billions of us must be motivated for change to be 
meaningful — and the best incentive is market forces, he says. 


On Extinction: How We Became Estranged from Nature 


Z 7 Melanie Challenger GRANTA 272 pp. £20 (2011) 

XTIN Writer and poet Melanie Challenger explores extinction, both biological 
bf Ps and cultural. Roving through wild landscapes — Antarctica and the 
re. Ri. *e Arctic, disused Cornish tin mines and old whaling stations in South 
Ss Georgia — she interweaves the science on ecological loss and climate 
‘3 “ ‘a change with histories of failed industries, extinguished languages and 
te ig wars. From albatrosses on Bird Island, South Georgia, to the Inuit of 

| j Canada’s Frobisher Bay, Challenger combines her meditations on our 
/ fragmenting world into a finely integrated study of loss. 
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native to the ‘executive gene’ idea, in which 
the genome is in control, he urges readers 
to think about the ‘executive cell’. Francis 
relegates genes to members of the cast, and 
insists that they are regulated by, not regula- 
tors of, cellular states. 
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sees DNA as a film script, with plenty of 
room for interpretation and retakes. Carey’s 
experience of the biotechnology industry 
shows in her concluding remarks on the 
pros and cons of our growing understand- 
ing of epigenetics for drug discovery, and 
on understanding the impact of diet and 
environment on disease. 

These two bold attempts to bring epige- 
netics to a wide audience are at the vanguard: 
more books will follow as the field matures. 
But future authors should avoid using the 
words ‘mystery’ and ‘revolutior in their 
titles. What makes the field so exciting is that 
we have new tools to address old questions. 
Unravelling life’s mysteries? That's not revo- 
lution, it is why we become scientists in the 
first place. My twin brother and I are both 
professors studying genetics — I still won- 
der whether that choice was determined by 
genetics or the environment. = 
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LINGUISTICS 


Tuning into translation 


Ellen Bialystok relishes a journey across the porous borders between languages. 


possibility of meaningful communi- 
cation, Is That a Fish in Your Ear? may 
reignite your anxiety. 

British translator David Bellos begins 
his journey through the fascinating history 
and convoluted present of translation with 
questions that seem simple, but swiftly enter 
complex territory. Language, he tells us, is a 
malleable thing. The boundaries we naively 
draw between languages are permeable, and 
all communication is interpretation. 

Translation encourages us to reconsider 
such issues as culture, nationalism, ethics 
and, above all, the part played by the mind 
— the precision and artistry that make trans- 
lation a creative process. Yet it is the mind, 
ironically, that is given the least attention in 
this otherwise wide-ranging description of 
translation. 

Bellos takes us through the meaning of 
the word ‘translatiom in different languages 
and times; the special case of biblical trans- 
lation and its influence on religious fol- 
lowing; translation in a modern industrial 
world where products are distributed inter- 
nationally; and technological innovations 
that increasingly assign the task of transla- 
tion to computers. 

Bellos explains why even the designa- 
tion of your mother tongue is not a simple 
judgement. The language of home can be 
supplanted, for instance, by that of educa- 
tion as the ‘most native’ of one’s languages. 
Neither is the choice of source and receiv- 
ing languages for a translation straight- 
forward, as exemplified by the French 
passages in the original 1869 Russian ver- 
sion of Tolstoy’s War and Peace that are 
sometimes translated, sometimes not. Add 
to this the widespread absorption of words 
from one language (particularly English) 
into another, together with the changes in 
language usage arising from texting and 
tweeting, and the very concept of a fixed 
language dissolves. 

In the absence of set structure and stable 
reference, how is translation possible at all? 
To answer this conundrum, Bellos ranges 
across linguistic theory, historical linguistics, 
personal experience and a deep knowledge 
of literature. Three of his examples shed light 
on the process of translation by describing 
the practical basics of the task, the politics 
that surround language choices and the sys- 
tems used to get the job done. 

First, to understand the challenge of 
translation fully, we must look to poetry, the 


|: you have ever despaired about the 


subtlest ofliterary forms. A good translation 
must be a good match, but which ofa poem's 
many dimensions need to be preserved? 
Meaning? Form? Metre? Rhyme, assonance 
or alliteration? 

Second, decisions on which languages 
get translated into which others, and which 
social style of speech is used, are inherently 
political. Only a small number of the world’s 
languages ever get translated, and an even 
smaller number make up the receiving 
languages; English wins again here. Bellos 
writes of the belief “that only some languages 
were suited to civilized thought” — a relic of 
nationalism and colonialism that continues 
to exert an insidious influence. 

Finally, there is the business of transla- 
tion. How does the United Nations, for 
instance, provide simultaneous interpreta- 
tions of proceedings for its entire assem- 
bly? The answer is an ingenious system that 
combines technology with economics, and 
overcomes the complexities of the task by 
deploying limited numbers of translators 
on strict rotas. 

Bellos has little to say about how 


|] 
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The complexities of simultaneous translation 
are a challenge at United Nations assemblies. 
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languages are 
learned, but makes 
several allusions to 
a critical period in 
childhood for learn- 
ing a second lan- 
guage. Although 
some theorists con- 
tinue to endorse that 
notion, it is incom- 
patible with the 
book’s dynamic view 
of language and the 
latest research. In my 
view, Bellos is incor- 
rect in his claim that 
no child develops lan- 
guage without being exposed to it. In 1977, 
a community of deaf children in Nicaragua 
with no exposure to language spontane- 
ously created Nicaraguan Sign Language, 
a perfectly systematic and rich system. A 
more detailed consideration of the role 
of the mind in language and translation 
would have been welcome. 

Concepts of language have evolved 
from twentieth-century structuralist and 
generativist theories — which explain lan- 
guage in terms of grammar — into one in 
which language seamlessly integrates into 
the cognitive, social and emotional lives of 
the humans who use it. Almost 40 years 
ago, psychologist Roger Brown coined 
the term ‘rich interpretation, describing 
the need to consider both what a child 
is doing and what he or she is saying in 
order to understand what the child is try- 
ing to convey. Although Bellos makes no 
reference to Brown, he implicitly extends 
Brown's idea, arguing that all meanings are 
embedded in dimensions outside language, 
and that a good translation captures many 
of them. He correctly identifies the central 
role of the mind in the process of interpre- 
tation that moves information between 
languages. 

So dont despair: human communication 
is possible, as is the translation of meaning 
across the shifting boundaries of language, 
even though the translation and the origi- 
nal are never identical. Any translation can 
signify a rose; an excellent one will convey 
its fragrance. m 


Is that a Fish 

in Your Ear? 
Translation and 
the Meaning of 
Everything 

DAVID BELLOS 
Faber & Faber: 2011. 
384 pp. $26, £20 
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specializing in bilingualism at York 
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Q&A Bjork 
Digital diva 


The Icelandic singer Bjérk’s album Biophilia, set for release on 10 October, features songs on 

DNA, crystals, viruses and electricity — each accompanied by an iPad app that doubles as a 
music lesson. She explains how her upbringing and environmental activism has inspired her, 
and how touch-screen technology has changed the way she composes. 


What is the Biophilia project? 

It's a multimedia exploration of music, nature 
and technology, which is also educational. 
Each song has a linked iPad app — with ani- 
mation, music scores and lyrics — that teaches 
kids the basics of music, such as rhythms 
and scales. I felt that the years between five 
and eight, when a child’s brain is soaking up 
languages and learning to read and write, are 
the perfect time to absorb musical theory. 
And now I can become a music teacher. We 
are planning live shows and workshops for 
kids at science museums, including ones in 
San Francisco and Tokyo. 


How did you put the tracks together? 

I took algorithms of patterns in nature, put 
them into an iPad, played with them and then 
plugged the result into an instrument. The 
arpeggios on the bassline of one track, “Thun- 
derbolt’ were created from the electrical dis- 
charge of a Tesla coil. A choppy, complex 
piece became ‘Crystalline, which is about 
structure. ‘Virus is a kind of love song, where 
viruses take over; it is about arrangement. 
‘Hollow; a song about DNA, is about rhythm; 
‘Dark Matter’ is about musical scales. 


What inspired you to link nature and music? 
I've always written songs while walking — in 
accompaniment with nature. I'd walk up a hill, 
and that would be the verse; when Id reach 
the top, that would be the chorus. And if it was 
snowing, or sunny, that would result in very 
different songs. I would mould the melodies 
by singing them over and over. Afterwards I'd 
get the computer out. But touch screens have 
revolutionized the way I work: now I can write 
and arrange songs at the same time. 


How else have touch screens influenced you? 
Ican compose more organically. Electronic 
music has been criticized for being too cold 

or grid-like. But with a 


> NATURE.COM touch screen the grid 
For more on Bjork's becomes like water. In 
Biophilia: the past I didn't have 
go.nature.com/aepg49 the right tools: on my 


album Homogenic I 
took electronic beats 
and distorted them to 
be like eruptions or 
lava — to be Icelan- 
dic. By using a touch 


Scan the tag above with the ; a. 
screen, Biophilia has 
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taken me to a different place. They are also 
good for education. The wrong way to be 
taught is with a book: it should be something 
you can turn in circles, and play with. 


Is Iceland a big influence? 

When I came up with the idea for Biophilia 
I was involved in an environmental battle to 
stop aluminium smelters from being built in 
Iceland. Then the bank crash happened and 
we were wondering how to create jobs. In 
Iceland — a volcanic place — a lot of peo- 
ple choose science subjects at university, 
especially geology, biology, physics and 
mathematics. But now, ina nation of 350,000, 
there are hundreds of unemployed biotech 
people. We're in trouble because we're not 
seeing the obvious. We need to find solutions 
to problems like pollution and energy that are 
sustainable, and I want to help. 


Are you interested in science? 

I was a maths nerd in school. And I was keen 
to learn more for this album. I watched a lot of 
educational DVDs, including ones on string 
theory. Some scientists are now saying that 
the Big Bang didn't exist. I think it is refresh- 
ing that the twenty-first century already has 
different ‘creation myths’ from the twentieth 
century. One track on Biophilia, ‘Cosmogony, 
is about creation myths. The first verse is from 
Native American culture; the second, Sanskrit 
scriptures; the third, Australian Aboriginal 
culture; and the fourth, Big Bang theory. Ina 
way, they are all saying the same thing. 


Did you discuss the project with scientists? 

I spoke to Oliver Sacks and read his book 
Musicophilia, on music and the brain. It was 
an inspiration for this project — and its title. 
As English isn’t my mother tongue, I thought 
biophilia meant ‘empathy with nature. Then 
somebody told me it means ‘love of life, which 
isn’t far off. Sacks’s book opens with a man 
who got hit by lightning and started hearing 
music in his head. He quit his job, learned the 
piano and became a composer. My favour- 
ite chapter is about the connection between 
Tourette's syndrome and jazz drumming. 


What links science and music? 

String theory is about strings that vibrate, 
which is essentially music. I’ve also read 
about the history of the music of the spheres, 
from Pythagoras to our time. Every hun- 
dred years or so, somebody tries to write the 
music of the spheres. It’s always a different 
time or country, but it’s always the same idea. 
Pythagoras was tapping into harmonics and 
how sound works, but that is similar to how 
atoms and the Solar System work. String 
theory is trying to unite the Solar System and 
the atoms. Like sound, you throw something 
out at the world and it comes back at you. = 
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Bureaucracy savings 
should go to research 


We believe that there is a 
pressing need to cut the cost of 
bureaucracy in universities and 
to channel those savings into 
research. 

In Australia, for example, 
spending by the top ten 
research universities rose from 
almost Aus$6 billion in 2003 
(US$3.9 billion at the time) to 
more than Aus$10 billion in 
2010 (2010 US$9.2 billion), but 
only 30% of this was allocated to 
the employment of academics. 
This is worrying, given that a 
university's main objectives are 
research and teaching. Almost 
as much (26%) was spent on 
non-academic salaries; the rest 
went on assets such as branding, 
buildings and equipment. 
Similar spending patterns can be 
found in UK and US universities. 

As well as the restrictive effect 
on academic appointments, this 
bureaucratic burden adversely 
affects scholarly output, with 
academics spending too 
much time on paperwork and 
internal meetings. Some must 
even forgo research altogether 
as they become swamped by 
administrative tasks. 

Senior university officials 
should make radical changes 
to the spending habits of 
institutions, despite the likely 
resistance that would come 
from non-academics. Australian 
academic Alan Trounson, 
president of the California 
Institute for Regenerative 
Medicine (CIRM) in San 
Francisco, has shown that such 
changes are feasible. In 2009, 
he pledged to spend less than 
6% of CIRM’s revenues on 
administration costs, a figure 
that is better than for most 
commercial firms (see go.nature. 
com/ummgkb). As of February 
2011, CIRM had spent 4.1% on 
administration (see go.nature. 
com/ntv5he). 

Administration should enable 
universities to run efficiently, but 
it must be streamlined. A bold 
university that reforms its cost 


structures will have more money 
to spend on its core business. 
Nicholas Graves, Adrian G. 
Barnett Queensland University 
of Technology, Queensland, 
Australia. 

n.graves@qut.edu.au 


Ireland: astrong 
knowledge economy 


I take issue with Mojgan 
Naghavi’s and Derek Walsh's 
critique of Science Foundation 
Ireland (SFI; Nature 476, 399; 
2011). Ireland has moved 
steadily up the country rankings 
of research performance and 

is now in the top 20, according 
to Thomson Reuters Essential 
Science Indicators. 

This success is largely a result 
of the rigour of the international 
competitive review process 
implemented by the SFI since 
2000, intended to select only the 
best science for funding. 

This strategy has been 
purposefully carried out 
by successive governments 
against a worsening domestic 
and international economic 
backdrop, exemplifying a 
consistent, considered and long- 
term approach to research in 
Ireland. 

Also, the SFI does not directly 
recruit researchers. Rather, it 
allows researchers and host 
institutions to compete for 
research funding. 

The foundation has made 
a significant investment in 
research infrastructure over the 
past decade. However, more 
than 75% of state investment 
has been, and continues to be, 
in the scientists themselves. The 
budget of the SFI in 2011, far 
from fading, was increased by 
7% at a time when most public 
spending has been significantly 
curtailed. 

Naghavi and Walsh's 
suggestion that Ireland is among 
nations with “ill-planned 
knowledge-economy ventures” 
is at odds with its impressive 
statistics on inward investment 
in research and development, 
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and the role of this in enhancing 
exports, increasing employment 
and adding value to Irish firms 
(see, for example, www.forfas.ie 
and www.idaireland.com). 
Graham Love Science 
Foundation Ireland, Dublin, 
Ireland. 

graham.love@sfi.ie 


Ireland: global links 
pay off 


I contend that Mojgan Naghavi'’s 
and Derek Walsh's claim of 
cronyism in the Irish university 
system is unfounded (Nature 
476, 399; 2011). 

Excellence in research and 
teaching are the criteria for 
recruitment. The openness 
of our recruitment system is 
evident: more than 35% of 
doctoral candidates and 35% 
of postdoctoral researchers in 
Ireland are from overseas, and 
26-42% of permanent academic 
staff across the seven Irish 
universities are international. 
They add strength to the cohort 
of tenured senior investigators, 
which is essential to the health of 
any advanced research system. 

Ireland is building up its 
research base through a long- 
term plan to attract and retain 
the best talent and to develop its 
research infrastructure. Progress 
began in the 1990s as a result of 
investment from the European 
Union’s Structural Funds and 
Framework Programmes for 
Research and Technological 
Development, which led to more 
international collaborations. 
Such investments are indicative 
of genuine strategic intent 
by Ireland’s government and 
universities. 

National policy initiatives 
have emphasized the need to 
invest in research as an integral 
part of economic development 
and, more recently, recovery 
(see, for example, go.nature. 
com/kwfpct). These have been 
implemented by consistent state 
and philanthropic investments 
in research, and by the ability 
of Irish universities to compete 


successfully for international 
research funds. As a result, the 
country now has state-of-the-art 
laboratories and research centres 
(see, for example, go.nature.com/ 
qzfaed) and, crucially, talented 
scientists to staff them. 

Conor O’Carroll Irish 
Universities Association, Dublin, 
Treland. 

conor.ocarroll@iua.ie 


More freedom for 
Turkish science 


The future of science in Turkey 
is being undermined by recent 
government actions (Nature 477, 
131; 2011). Academic autonomy 
is crucial if Turkish scientists are 
to flourish at home and abroad. 
A partisan approach will not 
advance Turkey’s society or stop 
the brain drain. 

The government should 
take advantage of the Turkish 
economy’s latest positive 
indications and increase the 
country’s modernization assets. 
At present, just 0.6% of gross 
domestic product is invested in 
research and development. 

To improve research morale, 
Turkey’s government needs 
to reverse the centralization 
of science policy-making and 
give more power to scientific 
committees. These should 
be made up of competent 
researchers, who understand 
their own needs better than 
officials. It should reform 
scientific councils to allow them 
to operate democratically and 
with transparency, and abandon 
partisan legislation. 

These measures would improve 
the credibility of science in society 
and help to educate a fatalistic and 
irrational public. Albert Einstein 
pointed out that imagination is 
more important than knowledge: 
the Turkish scientific community 
is simply seeking the prerogative 
and freedom to imagine. 

Caghan Kizil Technical 
University Dresden and 

DFG Center for Regenerative 
Therapies, Dresden, Germany. 
caghan.kizil@crt-dresden.de 
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STRUCTURAL BIOLOGY 


Snapshot of a signalling complex 


G-protein-coupled receptors initiate signalling pathways by forming complexes with agonist molecules and G proteins. 
The first crystal structure of such a complex is both reassuring and provocative. SEE ARTICLE P.549 & LETTER P.611 


THUE W. SCHWARTZ & THOMAS P. SAKMAR 


he most widely used signal-transduc- 
sk mechanism in nature involves 

G-protein-coupled receptors (GPCRs) 
embedded in cell membranes. Understanding 
precisely how this process occurs remains one 
of the most important and challenging ques- 
tions in biology, not least because GPCRs act 
as sensors for a multitude of signals — from 
hormones and nutrients to pheromones and 
even light. And they’re not just of interest 
to academics: GPCRs are the largest class of 
drug targets, with literally hundreds of dis- 
crete receptors amenable to pharmacological 
manipulation. 

The binding of an agonist — a receptor acti- 
vator — at the extracellular face of aGPCR 
causes the exchange of one nucleotide (GDP) 
for another (GTP) at a G protein bound to the 
receptor on the other side of the plasma mem- 
brane. The GTP-occupied a-subunit of the 
G protein and/or the By-subunit then activates 
intracellular effector molecules. In a paper 
that will become an instant classic (page 549), 
Rasmussen et al.' present the long-awaited 
crystal structure of a basic transmembrane 
signalling complex: the agonist-bound 
6,-adrenergic receptor (8,AR) coupled to its 
G-protein partner, G,. And in an accompany- 
ing paper on page 611 from the same group, 
Chung et al.” probe the protein dynamics of 
this signalling complex using a technique 
known as hydrogen-deuterium-exchange 
mass spectrometry (DXMS). 

During the past two decades, a series of 
X-ray structures has provided great insight 
into the structural biology surrounding GPCR 
signalling. Of particular note are the recently 
reported structures showing the receptor part 
of the active signalling complex bound to 
small fragments of the G protein’s a-subunit, 
or to antibodies that mimic the G protein’». 
In addition, essential dynamic properties of 
the complex have been uncovered by spectro- 
scopic and biophysical studies, many in the 
transduction system associated with vision”. 
Importantly, however, Rasmussen et al. ‘are the 
first to have crystallized the complete complex 
ofan active receptor and its G-protein partner. 

There is a direct line connecting the funda- 
mental, 30-year-old concept of ‘allosteric’ 


Detergent 
micelle 
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Figure 1 | Activated receptor-G-protein complex. 
Transmembrane G-protein-coupled receptors 
(GPCRs) form complexes with agonist molecules 
and G proteins to initiate a wide range of signalling 
pathways in cells. Rasmussen et al.' report the 

first crystal structure of a complete, activated 
GPCR signalling complex: the agonist-bound 
£,-adrenergic receptor (6, AR) in complex with its 
G protein, G,. In this schematic of the complex, a, 

B and y are the three subunits of G,. Also pictured 
are the protein T4-lysozyme (T,L) and a single- 
chain antibody (SCA), which were incorporated 
into the complex to facilitate its crystallization. The 
detergent micelle acts in lieu ofa cell membrane and 
maintains the structure of the complex. The zigzag 
lines on a and y are endogenous lipid modifications 
that anchor the G protein to the micelle. Broken 
arrows within the a-subunit indicate structural 
changes that lead to the separation of its two 
domains (solid double-headed arrow), which allows 
the G protein to exchange one nucleotide (GDP) 
for another (GTP; nucleotides not shown). The 
broken, double-headed arrow in B,AR indicates 

the allosteric mechanism that connects G-protein 
binding to high-affinity agonist binding. (Graphic 
designed by Karina Aberg.) 


coupling between receptor and G protein 
— and the corresponding effect of GDP on 
agonist binding*” — to Rasmussen and col- 
leagues’ crystallographic snapshot’ of the 
GPCR-mediated signal-transduction pro- 
cess. Making the connection involved hard 
work rather than good luck — much of it old- 
fashioned, cold-room biochemistry based on 
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more than 20 years’ worth of meticulously 
compiled tricks, skills and tools, not to 
mention considerable understanding of the 
B,AR system. 

A complete review of the biochemical 
methods used to prepare the material that 
eventually yielded crystals for Rasmussen 
and co-workers’ study would occupy an entire 
semester-long course in graduate school (and 
maybe it should). Selected highlights would 
include: the generation of a stable, nucleotide- 
free receptor-G-protein complex using an 
enzyme to remove GDP from the analogous 
GDP-containing complex; the development of 
antibodies from llamas that stabilize the active 
form of the G protein's a-subunit; and the use of 
single-particle electron microscopy to identify 
the most stable signalling complexes. These and 
similar approaches will certainly be useful in 
future work directed at improving the overall 
resolution of the structure of the agonist-bound 
B,AR-G, complex — especially the receptor 
portion of the complex, which is currently vis- 
ualized at relatively low resolution’ — and for 
trapping other important forms of the complex, 
such as those involved in receptor deactivation. 

The structure of Rasmussen and colleagues’ 
complex (Fig. 1) and its mechanistic implica- 
tions are striking, and would not all have been 
predicted. Surprisingly, the B- and y-subunits 
of G, barely make any direct contact with the 
receptor, but do facilitate interaction between 
the helical amino-terminal segment of the 
a-subunit of G, and an intracellular loop 
(known as intracellular loop 2) of the recep- 
tor. The carboxy-terminal tail of the a-subunit 
engages the receptor in a cleft generated by the 
large outward movement of TM6, one of the 
receptor’s seven transmembrane segments. 
The role of the a-subunit’s C terminus was, 
in fact, clear from other work, such as the 
previously reported structures* of rhodopsin 
(a GPCR involved in vision) in complex with 
a fragment of its corresponding G protein. 
However, the orientation and binding mode 
of this end of the a-subunit are different in the 
old‘ and new' structures, showing that major 
conformational changes occur between the 
first encounter of the receptor with the GDP- 
occupied G protein and the formation of 
the nucleotide-free complex depicted by 
Rasmussen and colleagues’ structure. 


In contrast to Rasmussen et al., who had to 
make several structural changes to the 8, AR-G, 
complex to facilitate its crystallization, Chung 
et al.” could perform their DXMS analysis on 
a mostly unmodified complex. Importantly, 
Chung and colleagues’ study confirms most of 
the changes of conformation and interactions 
predicted from the X-ray structure’. However, 
their results suggest that the RLLL peptide 
motif in G, (a motif found in all G-protein 
a-subunits) undergoes considerable changes 
to its structure and/or stability during forma- 
tion of the B,AR-G, complex. This is at odds 
with the X-ray structure, in which the motif 
seems to be firmly locked in the middle of 
a B-sheet. Because the motif forms part of a 
protein strand (a B-strand) that directly con- 
nects intracellular loop 2 of the agonist-bound 
receptor with the ‘P-loop’ of G, that binds to the 
6-phosphate in GDP, Chung and colleagues’ 
data’ provide a novel connection between 
receptor binding and nucleotide exchange that 
is not obvious from the X-ray structure’. 

The authors foresee that simultaneously, or 
possibly in rapid succession, the N-terminal 
helix of the a-subunit of G, destabilizes the 
adjacent -strand, and that the a-5 helix con- 
nected to the bound C-terminal tail becomes 
disordered. Both of these changes are com- 
municated to the GDP-binding pocket of 
G,, where two large domains of the protein’s 
a-subunit open like a clamshell. This confirms 
a prediction made 18 years ago”, on the basis 
of the first X-ray structures of a G-protein 
a-subunit, that an activated receptor must 
open up the cleft in the a-subunit to allow 
nucleotide exchange. However, it seems that 
the ‘clamshell’ in Rasmussen and colleagues’ 
structure may have been caught in an extreme, 
open form in their crystals, as a recent spectro- 
scopic analysis’ of the rhodopsin-G-protein 
complex demonstrated that a more subtle, but 
still clear, opening occurs during nucleotide 
exchange. 

For the first time, we have a detailed three- 
dimensional structure of the large (1,300 A’) 
interface between an activated GPCR and its 
G-protein partner. We can now breathe a sigh 
of relief. But there will be no rest for weary 
workers in the GPCR field, because although 
the structure of the 8, AR-G, complex is reas- 
suring, it is also provocative. Structural con- 
nections in the B, AR-G, interface seem to be 
responsible for the allosteric mechanism that 
links receptor coupling to nucleotide exchange 
on the G protein. However, the structural 
basis for the increase in agonist affinity that 
occurs on G-protein binding is still elusive. 
Moreover, we still have no clear picture of 
what determines G-protein selectivity — the 
last five amino acids of a G-protein a-subunit 
determine its receptor binding partner", but 
Rasmussen and colleagues’ B,AR-G, structure’ 
does not tell us how this happens. 

Structures of additional receptor-G-protein 
complexes are therefore required, as are 


complementary approaches of investigation, 
such as systems and molecular-dynamics 
simulations, and spectroscopic and other bio- 
physical approaches”. And we have not even 
mentioned GPCR kinases and f-arrestins, 
the proteins that are central to receptor 
desensitization and signalling through other 
intracellular pathways”. But, for now, it’s time 
to celebrate and enjoy a brief respite. Then it’s 
back to the cold room. = 


Thue W. Schwartz is in the Laboratory for 
Molecular Pharmacology, Department of 
Neuroscience and Pharmacology, and at the 
NNF Center for Basic Metabolic Research, 
Faculty of Health Sciences, University of 
Copenhagen, DK-2200 Copenhagen, Denmark. 
Thomas P. Sakmar is in the Laboratory of 
Molecular Biology and Biochemistry, 
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Aof 


Gravity tested 
on large scales 


Albert Einstein’s general theory of relativity has passed a rigorous test on the 
scale of galaxy clusters. But the test does not rule out a popular alternative model 


of gravity. SEE LETTER P.567 


GARY WEGNER 


ravity lies at the heart of understanding 
the cosmos. Tests of theories of grav- 
ity carried out in the laboratory — by 
British scientist Henry Cavendish, Hungarian 
physicist Roland von Eétvés, and a host of oth- 
ers up to the present day — are conceptually 
simple but are hampered by gravity’s intrinsic 
weakness. Tests on the scale of our Solar Sys- 
tem do better than laboratory tests, but clean 
tests of gravity on cosmic scales (those larger 
than individual galaxies) are difficult to per- 
form. On page 567 of this issue, Wojtak et al.' 
describe how they have successfully tested 
Albert Einstein’s theory of gravity, the general 
theory of relativity, against two well-known 
alternative theories of gravity on cosmic scales 
of the order of 1-10 megaparsecs. They did 
so by measuring the gravitational redshift of 
light from clusters of galaxies, an effect sepa- 
rate from the cosmological redshift, which is 
caused by the expansion of the Universe. 
General relativity is the basis of our under- 
standing of gravity. But twenty-first-century 
work in cosmology and particle theory 
strengthens the belief that it is an incomplete 
description. Although general relativity within 
the currently favoured ‘A cold dark matter’ 
(ACDM) cosmological model is undeniably 
highly successful in explaining cosmic-scale 


observations (such as the cosmic microwave 
background radiation, baryonic oscillations, 
gravitational lensing and large-scale struc- 
ture), many loose ends remain, ranging from 
assumptions about the nature of the Universe's 
dark matter and dark energy to details of 
galaxy formation and structure. Consequently, 
alternative theories of gravity have been con- 
structed that attempt to eliminate the neces- 
sity for dark matter or dark energy in order to 
explain cosmic-scale observations. 

Of the three ‘classical’ tests of general relativ- 
ity — which involve measuring the bending of 
light by the Sun, the precession of the point of 
closest approach of Mercury to the Sun, and 
the gravitational redshift of light — the first 
two have been incorporated into a wider gen- 
eration of new gravitational and cosmologi- 
cal measurements. Different gravity models 
can be compared with observations, with the 
result that general relativity is usually the win- 
ner. The third test (the gravitational redshift) 
measures a photon’s frequency change as it 
moves in a gravitational potential. Accord- 
ing to Clifford Will’, it tests Einstein’s equiva- 
lence principle rather than being a true test of 
general relativity, but implies that gravity is a 
‘metric theory. Its terrestrial verifications have 
achieved high precision, but in the case of the 
Sun, measurements are dogged by pressure 
shifts and motions of the Sun’s visible surface, 
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In contrast to Rasmussen et al., who had to 
make several structural changes to the 8, AR-G, 
complex to facilitate its crystallization, Chung 
et al.” could perform their DXMS analysis on 
a mostly unmodified complex. Importantly, 
Chung and colleagues’ study confirms most of 
the changes of conformation and interactions 
predicted from the X-ray structure’. However, 
their results suggest that the RLLL peptide 
motif in G, (a motif found in all G-protein 
a-subunits) undergoes considerable changes 
to its structure and/or stability during forma- 
tion of the B,AR-G, complex. This is at odds 
with the X-ray structure, in which the motif 
seems to be firmly locked in the middle of 
a B-sheet. Because the motif forms part of a 
protein strand (a B-strand) that directly con- 
nects intracellular loop 2 of the agonist-bound 
receptor with the ‘P-loop’ of G, that binds to the 
6-phosphate in GDP, Chung and colleagues’ 
data’ provide a novel connection between 
receptor binding and nucleotide exchange that 
is not obvious from the X-ray structure’. 

The authors foresee that simultaneously, or 
possibly in rapid succession, the N-terminal 
helix of the a-subunit of G, destabilizes the 
adjacent -strand, and that the a-5 helix con- 
nected to the bound C-terminal tail becomes 
disordered. Both of these changes are com- 
municated to the GDP-binding pocket of 
G,, where two large domains of the protein’s 
a-subunit open like a clamshell. This confirms 
a prediction made 18 years ago”, on the basis 
of the first X-ray structures of a G-protein 
a-subunit, that an activated receptor must 
open up the cleft in the a-subunit to allow 
nucleotide exchange. However, it seems that 
the ‘clamshell’ in Rasmussen and colleagues’ 
structure may have been caught in an extreme, 
open form in their crystals, as a recent spectro- 
scopic analysis’ of the rhodopsin-G-protein 
complex demonstrated that a more subtle, but 
still clear, opening occurs during nucleotide 
exchange. 

For the first time, we have a detailed three- 
dimensional structure of the large (1,300 A’) 
interface between an activated GPCR and its 
G-protein partner. We can now breathe a sigh 
of relief. But there will be no rest for weary 
workers in the GPCR field, because although 
the structure of the 8, AR-G, complex is reas- 
suring, it is also provocative. Structural con- 
nections in the B, AR-G, interface seem to be 
responsible for the allosteric mechanism that 
links receptor coupling to nucleotide exchange 
on the G protein. However, the structural 
basis for the increase in agonist affinity that 
occurs on G-protein binding is still elusive. 
Moreover, we still have no clear picture of 
what determines G-protein selectivity — the 
last five amino acids of a G-protein a-subunit 
determine its receptor binding partner", but 
Rasmussen and colleagues’ B,AR-G, structure’ 
does not tell us how this happens. 

Structures of additional receptor-G-protein 
complexes are therefore required, as are 


complementary approaches of investigation, 
such as systems and molecular-dynamics 
simulations, and spectroscopic and other bio- 
physical approaches”. And we have not even 
mentioned GPCR kinases and f-arrestins, 
the proteins that are central to receptor 
desensitization and signalling through other 
intracellular pathways”. But, for now, it’s time 
to celebrate and enjoy a brief respite. Then it’s 
back to the cold room. = 
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ravity lies at the heart of understanding 
the cosmos. Tests of theories of grav- 
ity carried out in the laboratory — by 
British scientist Henry Cavendish, Hungarian 
physicist Roland von Eétvés, and a host of oth- 
ers up to the present day — are conceptually 
simple but are hampered by gravity’s intrinsic 
weakness. Tests on the scale of our Solar Sys- 
tem do better than laboratory tests, but clean 
tests of gravity on cosmic scales (those larger 
than individual galaxies) are difficult to per- 
form. On page 567 of this issue, Wojtak et al.' 
describe how they have successfully tested 
Albert Einstein’s theory of gravity, the general 
theory of relativity, against two well-known 
alternative theories of gravity on cosmic scales 
of the order of 1-10 megaparsecs. They did 
so by measuring the gravitational redshift of 
light from clusters of galaxies, an effect sepa- 
rate from the cosmological redshift, which is 
caused by the expansion of the Universe. 
General relativity is the basis of our under- 
standing of gravity. But twenty-first-century 
work in cosmology and particle theory 
strengthens the belief that it is an incomplete 
description. Although general relativity within 
the currently favoured ‘A cold dark matter’ 
(ACDM) cosmological model is undeniably 
highly successful in explaining cosmic-scale 


observations (such as the cosmic microwave 
background radiation, baryonic oscillations, 
gravitational lensing and large-scale struc- 
ture), many loose ends remain, ranging from 
assumptions about the nature of the Universe's 
dark matter and dark energy to details of 
galaxy formation and structure. Consequently, 
alternative theories of gravity have been con- 
structed that attempt to eliminate the neces- 
sity for dark matter or dark energy in order to 
explain cosmic-scale observations. 

Of the three ‘classical’ tests of general relativ- 
ity — which involve measuring the bending of 
light by the Sun, the precession of the point of 
closest approach of Mercury to the Sun, and 
the gravitational redshift of light — the first 
two have been incorporated into a wider gen- 
eration of new gravitational and cosmologi- 
cal measurements. Different gravity models 
can be compared with observations, with the 
result that general relativity is usually the win- 
ner. The third test (the gravitational redshift) 
measures a photon’s frequency change as it 
moves in a gravitational potential. Accord- 
ing to Clifford Will’, it tests Einstein’s equiva- 
lence principle rather than being a true test of 
general relativity, but implies that gravity is a 
‘metric theory. Its terrestrial verifications have 
achieved high precision, but in the case of the 
Sun, measurements are dogged by pressure 
shifts and motions of the Sun’s visible surface, 
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Figure 1 | Measuring the gravitational redshift of an idealized galaxy cluster. a, The cluster is centred 
on the brightest cluster galaxy (BCG) and sectioned into regions of increasing radius (here A-C). At 
corresponding radii (R), below, the velocities of cluster members relative to the velocity of the BCG, 
AV,,. lie within trumpet-like ‘caustic curves. b, Binning the velocities at the different radii gives velocity 


distributions; N,,1 


denotes the number of galaxies. Were there no gravitational redshift, the peaks of 


the distributions at different radii would average zero. c, With the gravitational redshift present, the 

peaks shift in velocity; the gravitational redshift is greatest at the cluster centre, so relative shifts become 
increasingly negative with radius. Wojtak et al. constructed an averaged galaxy cluster based on archival 
data, measured its gravitational redshift, and compared the result with the predictions of general relativity 


and two other models of gravity. 


the photosphere. Tests have also been carried 
out using white-dwarf stars, such as Sirius B 
and 40 Eri B, in binary star systems. But none 
of these tests probes cosmic distances. 

Wojtak and colleagues’ seem to be the first 
to successfully obtain gravitational redshifts 
for galaxy clusters. Although such gravitational 
redshifts have been discussed previously’, 
there were insufficient galaxy data to produce 
significant results. For typical galaxy clusters, 
which are about 10" times more massive than 
the Sun, the gravitational redshift corresponds 
to a velocity of the order of 10 kilometres 
per second. This is tiny in comparison with 
the spread of velocities of the constituent 
galaxies in a cluster. In some clusters the 
spread can reach 1,000 km st, illustrating the 
difficulty of the measurement, to which other 
complications also contribute. 

It is currently impracticable to determine the 
gravitational redshifts of individual clusters. 
Clusters have no ‘surface, unlike the relatively 
simple case of the Sun, for which only a single 
measurement suffices, and the gravitational 
redshift varies with the line of sight through 
a cluster, as do the motions of the galaxies. 
This means that many galaxies per cluster are 
needed to determine the density and orbital 
structures of the cluster. 

The term redshift can be confusing. The 


measured redshift of an individual galaxy in a 
cluster is the combination of several variables. 
In addition to the gravitational redshift, there 
are the cosmological redshift, the effects of the 
galaxy’s motion in the cluster and the motion 
of the cluster itself. So, in order to determine 
the gravitational redshift, these other effects 
must be disentangled. Galaxy clusters are not 
always spherical, which adds to the complex- 
ity of interpreting the geometry of the galaxy 
velocities, and available data often provide only 
a handful of galaxy-velocity measurements per 
cluster. However, one might overcome this by 
combining galaxy clusters. Using simulations 
within the framework of the ACDM cosmo- 
logical model, it was found’ that the averaging 
of about 2,500 clusters is required to compute 
a reliable gravitational redshift. 

Wojtak et al. therefore constructed an aver- 
aged galaxy cluster. They used a subsample of 
a new galaxy cluster catalogue® based on the 
Sloan Digital Sky Survey. They extracted 7,800 
suitable galaxy clusters and 120,000 galaxy red- 
shifts, which, as already noted, contain compo- 
nents of redshift other than the gravitational 
redshift. To extract these data, they defined a 
cluster member as lying within 6 megaparsecs 
of the cluster centre projected on the sky and 
within + 4,000 km s ‘in redshift, which statis- 
tically corresponds to a gravitationally bound 
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system. To determine each cluster’s centre, 
which is needed both to define the cluster 
members and to combine the clusters, they 
used the brightest cluster galaxy (BCG), a type 
of galaxy usually found near the cluster cen- 
tre. Defining a cluster member's redshift and 
velocity relative to the BCG allowed Wojtak 
et al. to eliminate the contributions of the 
cosmological redshift and cluster motion. This 
is an approximation, however. A cluster’s BCG 
can lie significantly far from the bottom of the 
cluster’s gravitational potential well and be 
moving, so the authors made a correction for 
the error produced by its not always being at 
the centre of the cluster. 

After removing the contribution from 
contaminating background and foreground 
galaxies that are not associated with the clus- 
ters, Wojtak et al. assumed that the clusters 
are spherical on average, combined them and 
constructed the mean cluster, from which they 
could extract a density and a gravitational- 
redshift profile (Fig. 1). Given all these, albeit 
reasonable, assumptions, it is reassuring that 
the resulting gravitational redshifts in the aver- 
aged galaxy cluster are of the expected order 
of magnitude. Some readers might worry that 
the gravitational redshifts are smaller than the 
errors of measurement for each individual 
galaxy, and that small systematic errors and 
biases in the galaxy-velocity measurements 
could affect the result. As a check, Wojtak 
et al.’ carried out ACDM numerical simula- 
tions on mock data and found that such effects 
did not significantly affect their results. Their 
gravitational redshifts are detected at the 99% 
confidence level. 

Wojtak and colleagues’ went on to compare 
the observed galaxy-cluster gravitational red- 
shifts with the predictions of general relativity 
and modified gravity theories as a function 
of cluster radius. An important assumption 
needed for calculating the expected redshifts is 
that the cluster density follows the well-known 
Navarro—Frenk-White profile’, which the 
authors also found in their foregoing analysis 
from their measurements of the variation of 
the number of galaxies and the spread of their 
velocities with radius in the mean cluster. 

The authors considered two popular 
alternative gravity models. One, known as 
f(R) theory’, is consistent with general rela- 
tivity on small scales and accounts for dark 
energy by invoking the ‘chameleon’ mecha- 
nism on cosmological scales; the other is the 
tensor-vector-scalar (TeVeS) theory’, which 
is a generalization of the modified Newtonian 
dynamics (MOND) model" and is designed 
to eliminate the need for dark matter. Wojtak 
et al. found that general relativity fits best with 
the measured gravitational redshifts, that 
F(R) theory is not ruled out, but that TeVeS/ 
MOND is discrepant. Such a conclusion has 
been reached independently by other investi- 
gators, for example Reyes et al.”’, who analysed 
gravitational lensing and clustering obtained 


from 70,000 galaxies in the Sloan Digital Sky 
Survey. 

Discussions of gravity’s properties will con- 
tinue, but Wojtak and colleagues’ pioneering 
work" gives a glimpse of the potential of new 
cosmological tests for achieving higher preci- 
sion when millions of galaxy redshifts, from 
which gravitational redshifts can be extracted, 
become available in the future. = 


IMMUNOLOGY 


Gary Wegner is in the Department of Physics 
and Astronomy, Dartmouth College, Hanover, 
New Hampshire 03755, USA. 

e-mail: gary.wegner@dartmouth.edu 


1. Wojtak, R., Hansen, S. H. & Hjorth, J. Nature 477, 
567-569 (2011). 

2. Will, C. M. Living Rev. Relativity 9, 3 (2006). 

3. Cappi, A. Astron. Astrophys. 301, 6-10 (1995). 

4. Broadhurst, T. & Scannapieco, E. Astrophys. J. 


Recognition of 
a unique partner 


The mammalian immune system can fight a myriad of pathogens. In part, this 


involves a superfamily of cytoplasmic receptors that dictate assembly of specific 
pro-inflammatory inflammasome complexes. SEE LETTERS P.592 & P.596 


DENISE M. MONACK 


he innate immune system recog- 

nizes and responds to pathogens in a 

generic way. But it is not quite right to 
call it nonspecific, as is often done. Although 
certain aspects of innate immunity are non- 
specific, there is a growing appreciation that 
multicellular organisms have evolved specific 
recognition mechanisms as part of their innate 
immune defences to trigger clearance of path- 
ogens. Two papers in this issue, by Kofoed and 
Vance! and by Zhao et al.’, provide the latest 
evidence. They describe a family of intracellu- 
lar receptors called NAIPs that ensure specific 
recognition of virulence-associated molecules 
— the molecules that are released by bacterial 
pathogens into the cytoplasm of the mouse 
immune cells in which they reside. 

NAIPs belong to the NOD-like receptor 
(NLR) family of proteins, members of which 
have crucial roles in response to microbial 
infection. On detection of microbial compo- 
nents within the host-cell cytoplasm, NLRs 
initiate transcriptional programs that activate 
the immune response and trigger the activa- 
tion of the protein-cleaving enzyme caspase-1. 
As part of a multiprotein complex called the 
inflammasome, caspase-1 cleaves and activates 
the pro-inflammatory immune mediators 
IL-18 and IL-18, and initiates a pro-inflam- 
matory cell-death process called pyroptosis* 
(Fig. 1). Although previous genetic data’ 
support a crucial role for inflammasomes in 
immune defence and in inflammatory diseases, 
we are just beginning to understand the molec- 
ular mechanisms by which NLRs recognize 
specific foreign molecules and then induce the 
formation of inflammasomes. 

Earlier studies’ * showed that inflam- 
masomes containing an NLR protein called 


NLRC4 are activated in response to two 
distinct bacterial proteins. One is flagellin 
— which makes up the whip-like flagella 
that help bacteria to swim — and the other 
is PrgJ, an evolutionarily conserved compo- 
nent of bacterial type-II secretion systems, 
which mediates bacterial virulence. But how 
exactly the NRLC4 inflammasome responds 
to these different pathogen-associated 
proteins remained a mystery. 

Ina series of elegant experiments, Kofoed 
and Vance’ (page 592) and Zhao and col- 
leagues” (page 596) demonstrate that specific 
NAIPs bind directly to particular bacterial 
proteins. This leads to interaction between 
the NAIPs and NLRC4 and so to the activa- 
tion of caspase- 1. These results underscore the 
role of NAIPs in dictating the specificity of the 
NLRC4 inflammasome (Fig. 1). 

The Vance group had previously shown’, in 
immune cells called macrophages, that NAIP5 
is required for NLRC4-dependent activation 
of caspase-1 in response to Legionella pneu- 
mophila — a bacterial pathogen that causes 
Legionnaires’ disease — and to the carboxy 
terminus of flagellin. But NAIP5 was not 
essential for caspase-1 activation in response 
to another flagellated intracellular bacterium, 
Salmonella enterica Typhimurium (which 
causes diarrhoea), or to its PrgJ protein’. As 
well as NAIP5, mouse macrophages express 
other NAIPs (NAIP1, NAIP2 and NAIP6), 
and so the authors proposed that each NAIP 
may have evolved to be specific for a unique 
bacterial ligand. 

The new studies'” provide genetic evidence 
congruent with this proposal. Reducing 
the expression of the Naip2 gene in isolated 
mouse macrophages prevented caspase-1 
activation and pyroptosis in response to 
conserved rod proteins of type-II secretion 
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Figure 1 | NAIPs and ligand specificity of 
inflammasomes. Once engulfed by macrophages, 
pathogenic bacteria release virulence-associated 
molecules, such as PrgJ and flagellin, into the 
cytoplasm. New work'” in mouse cells shows 

that one of several related NAIP proteins, such 

as NAIP2, NAIP5 and NAIP6, binds specifically 
to such virulence-associated ligands. The 
NAIP-ligand complexes then bind directly to 
NLRC4, which is bound to the enzyme caspase-1 
through a small adaptor molecule (red), thereby 
forming NAIP-containing inflammasomes. 
Caspase-1 is then activated, leading to activation 
of the pro-inflammatory immune mediators IL-1f 
and IL-18 and to host-cell death by pyroptosis’. 


systems of bacterial pathogens — PrgJ from 
S. Typhimurium and BsaK from Burkholderia 
thailandensis. This manipulation, however, 
did not affect pyroptosis in response to flagel- 
lin. Together with the authors’ other findings’, 
the latest results indicate that NAIP2 is spe- 
cifically required for activation of the NRLC4 
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from 70,000 galaxies in the Sloan Digital Sky 
Survey. 

Discussions of gravity’s properties will con- 
tinue, but Wojtak and colleagues’ pioneering 
work" gives a glimpse of the potential of new 
cosmological tests for achieving higher preci- 
sion when millions of galaxy redshifts, from 
which gravitational redshifts can be extracted, 
become available in the future. = 
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— the molecules that are released by bacterial 
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immune cells in which they reside. 

NAIPs belong to the NOD-like receptor 
(NLR) family of proteins, members of which 
have crucial roles in response to microbial 
infection. On detection of microbial compo- 
nents within the host-cell cytoplasm, NLRs 
initiate transcriptional programs that activate 
the immune response and trigger the activa- 
tion of the protein-cleaving enzyme caspase-1. 
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inflammasome, caspase-1 cleaves and activates 
the pro-inflammatory immune mediators 
IL-18 and IL-18, and initiates a pro-inflam- 
matory cell-death process called pyroptosis* 
(Fig. 1). Although previous genetic data’ 
support a crucial role for inflammasomes in 
immune defence and in inflammatory diseases, 
we are just beginning to understand the molec- 
ular mechanisms by which NLRs recognize 
specific foreign molecules and then induce the 
formation of inflammasomes. 

Earlier studies’ * showed that inflam- 
masomes containing an NLR protein called 


NLRC4 are activated in response to two 
distinct bacterial proteins. One is flagellin 
— which makes up the whip-like flagella 
that help bacteria to swim — and the other 
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leagues” (page 596) demonstrate that specific 
NAIPs bind directly to particular bacterial 
proteins. This leads to interaction between 
the NAIPs and NLRC4 and so to the activa- 
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is required for NLRC4-dependent activation 
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Legionnaires’ disease — and to the carboxy 
terminus of flagellin. But NAIP5 was not 
essential for caspase-1 activation in response 
to another flagellated intracellular bacterium, 
Salmonella enterica Typhimurium (which 
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inflammasome by the conserved components 
of the type-III secretion system, whereas 
NAIP5 and NAIP6 are specifically required 
for activation of the NLRC4 inflammasome 
by flagellin. 

How inflammasomes are activated has been 
a matter of hot debate among immunologists. 
It was recently shown that, during infec- 
tions, the intracellular receptor AIM2 binds 
to double-stranded viral or bacterial DNA in 
the host-cell cytoplasm, leading to the activa- 
tion of an AIM2-containing inflammasome’. 
However, direct binding of ligands to NAIPs or 
to NLRC4 during bacterial infection had not 
been demonstrated. 

Both teams’ now provide in vitro biochem- 
ical evidence that NAIP5 binds to flagellin 
and that NAIP2 binds to PrgJ/BsaK. More- 
over, ligand binding led to the association of 
NAIP2 or NAIP5 with NLRC4, suggesting that 
NAIPs act before NLRC4 in the inflammatory 
pathway, dictating the specificity of inflamma- 
some activation (Fig. 1). Although these find- 
ings shed light on some of the mechanisms 


CLIMATE SCIENCE 


underlying inflammasome activation, it will 
be exciting to demonstrate ligand binding to 
NAIPs and the presence of NAIP-containing 
inflammasomes in infected macrophages. 

As shown by the in vitro studies described 
above!”, one role of NAIPs is to recognize 
specific bacterial ligands in the cytoplasm 
of immune cells. But mice have multiple 
related Naip genes (Naip1-7), and a task for 
the future will be to determine the specific 
roles of their encoded proteins during infec- 
tion. Studies in mice with altered Naip genes 
may determine whether individual NAIPs 
are expressed in a tissue- or cell-type-specific 
manner. 

Unlike mice, humans have only one func- 
tional NAIP gene, which encodes the human 
NAIP protein. Zhao and colleagues” show 
that human NAIP does not respond to intra- 
cellular delivery of flagellin and BsaK/Prg]J- 
like rod proteins. Instead, it responds to Cprl 
— the needle protein of the type-II secretion 
system from the bacterium Chromobacterium 
violaceum. In humans, C. violaceum infections 


Ocean dynamics 
not required? 


There are many challenges involved in understanding climate variations in the 
tropics, and how the Pacific climate might change with global warming. One 
study offers a surprising perspective on the mechanisms involved. 


JING-JIA LUO 


ropical climatic phenomena, such as 

the anomalous surface warming of 

the central and eastern Pacific Ocean 
known as El Nifio, often have significant 
environmental and socio-economic effects 
worldwide. Understanding their dynamics is 
vital to global climate prediction. Prevailing 
theories assume that the essential mechanism 
underpinning tropical climate is a dynamic 
ocean-atmosphere interaction’. However, this 
idea is now being challenged, particularly for 
climatic variations that take place over more 
than just a few years. Writing in the Journal of 
Climate, Clement et al.’ argue impressively that 
it is not necessary to couple ocean dynamics to 
the atmosphere in models to reproduce tropi- 
cal climate modes and their associated global 
connections. 

The atmosphere over the warm tropical 
Pacific is unstable. For instance, increases in 
sea surface temperature (SST) in the central 
and eastern regions can trigger excessive deep 
convection and large-scale adjustments to the 
circulation in the atmosphere, slackening the 


easterly trade winds. The wind perturbations 
can drive ocean currents and propagate fast 
equatorial subsurface waves, which deepen the 
ocean's thermocline (a layer of water that sepa- 
rates the warm, upper part of the ocean from 
the cold, deeper water). This intensifies the 
initial increase in SST, particularly in the east- 
ern Pacific, where the thermocline is shallow 
(about 50 metres below the surface), further 
weakening the trade winds. Thus, tropical cli- 
mate signals such as increases in SST can grow 
rapidly to become some of the most important 
on Earth. The dynamic interactions between 
SST, trade winds and the thermocline are 
termed the Bjerknes feedback’, and form the 
foundation of our understanding of tropical- 
climate phenomena on seasonal and longer 
timescales. 

But how essential are these interactions? 
Clement et al.’ have investigated this issue by 
comparing the results from a pair of experi- 
ments, each of which simulated tropical cli- 
mate using 13 models. In the first experiment, 
the models coupled the atmosphere thermo- 
dynamically to a motionless slab of ocean 
through heat and moisture fluxes. Because 
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are rare. So further studies are required to 
determine the relevance of this NAIP speci- 
ficity in humans. The genomic difference 
between humans and mice in terms of Naip 
copy number indicates divergent evolution 
of this particular arm of the innate immune 
system. Who knows, humans may have a 
superfamily of NLRs that determine inflam- 
masome specificity in a similar way to the 
NAIPs in mice. = 
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the ocean was motionless, this excluded the 
influence of ocean dynamics, thus disabling 
the Bjerknes feedback. Surprisingly, these 
models still reproduced the Southern Oscilla- 
tion — the large-scale oscillation in sea-level 
atmospheric pressure that occurs between the 
eastern and western Pacific, and that is usually 
viewed as an atmospheric response to El Nifio 
through the Bjerknes feedback. 

The authors’ models show that, in the 
absence of ocean dynamics, the state of the 
ocean's mixed layer — the uppermost layer 
that is affected by turbulence caused by wind, 
evaporative effects and so on — is driven by 
stochastic effects from the atmosphere. The 
ocean responds to these high-frequency sig- 
nals from the atmosphere by turning them into 
lower-frequency effects, such as variations in 
SST. Statistical analysis of their data allowed 
the authors to plot SST spectra, which show 
how the magnitude of variation in SST changes 
with the period of the variation. In this way, the 
authors found that the variance of SST in their 
slab-ocean models increased with the period 
of the variation (to use the jargon, the SST 
spectrum was ‘red’). 

The ocean's role in converting stochastic 
forcing into red noise signals was proposed 
some time ago by Klaus Hasselmann’‘, and has 
often been applied to interpret mid-latitude 
SST variations where the ocean is inert — that 
is, where the ocean has a thick mixed layer 
and slow subsurface-wave propagation — and 
where the atmosphere behaves randomly and 
is not sensitive to the SST. But Hasselmann’s 
model has rarely been used to interpret atmos- 
pheric variability, and Clement et al. are the 
first to use it to explain the Southern Oscilla- 
tion. The authors’ results are reasonable and 
convincing. 
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inflammasome by the conserved components 
of the type-III secretion system, whereas 
NAIP5 and NAIP6 are specifically required 
for activation of the NLRC4 inflammasome 
by flagellin. 

How inflammasomes are activated has been 
a matter of hot debate among immunologists. 
It was recently shown that, during infec- 
tions, the intracellular receptor AIM2 binds 
to double-stranded viral or bacterial DNA in 
the host-cell cytoplasm, leading to the activa- 
tion of an AIM2-containing inflammasome’. 
However, direct binding of ligands to NAIPs or 
to NLRC4 during bacterial infection had not 
been demonstrated. 

Both teams’ now provide in vitro biochem- 
ical evidence that NAIP5 binds to flagellin 
and that NAIP2 binds to PrgJ/BsaK. More- 
over, ligand binding led to the association of 
NAIP2 or NAIP5 with NLRC4, suggesting that 
NAIPs act before NLRC4 in the inflammatory 
pathway, dictating the specificity of inflamma- 
some activation (Fig. 1). Although these find- 
ings shed light on some of the mechanisms 


CLIMATE SCIENCE 


underlying inflammasome activation, it will 
be exciting to demonstrate ligand binding to 
NAIPs and the presence of NAIP-containing 
inflammasomes in infected macrophages. 

As shown by the in vitro studies described 
above!”, one role of NAIPs is to recognize 
specific bacterial ligands in the cytoplasm 
of immune cells. But mice have multiple 
related Naip genes (Naip1-7), and a task for 
the future will be to determine the specific 
roles of their encoded proteins during infec- 
tion. Studies in mice with altered Naip genes 
may determine whether individual NAIPs 
are expressed in a tissue- or cell-type-specific 
manner. 

Unlike mice, humans have only one func- 
tional NAIP gene, which encodes the human 
NAIP protein. Zhao and colleagues” show 
that human NAIP does not respond to intra- 
cellular delivery of flagellin and BsaK/Prg]J- 
like rod proteins. Instead, it responds to Cprl 
— the needle protein of the type-II secretion 
system from the bacterium Chromobacterium 
violaceum. In humans, C. violaceum infections 


Ocean dynamics 
not required? 


There are many challenges involved in understanding climate variations in the 
tropics, and how the Pacific climate might change with global warming. One 
study offers a surprising perspective on the mechanisms involved. 


JING-JIA LUO 


ropical climatic phenomena, such as 

the anomalous surface warming of 

the central and eastern Pacific Ocean 
known as El Nifio, often have significant 
environmental and socio-economic effects 
worldwide. Understanding their dynamics is 
vital to global climate prediction. Prevailing 
theories assume that the essential mechanism 
underpinning tropical climate is a dynamic 
ocean-atmosphere interaction’. However, this 
idea is now being challenged, particularly for 
climatic variations that take place over more 
than just a few years. Writing in the Journal of 
Climate, Clement et al.’ argue impressively that 
it is not necessary to couple ocean dynamics to 
the atmosphere in models to reproduce tropi- 
cal climate modes and their associated global 
connections. 

The atmosphere over the warm tropical 
Pacific is unstable. For instance, increases in 
sea surface temperature (SST) in the central 
and eastern regions can trigger excessive deep 
convection and large-scale adjustments to the 
circulation in the atmosphere, slackening the 


easterly trade winds. The wind perturbations 
can drive ocean currents and propagate fast 
equatorial subsurface waves, which deepen the 
ocean's thermocline (a layer of water that sepa- 
rates the warm, upper part of the ocean from 
the cold, deeper water). This intensifies the 
initial increase in SST, particularly in the east- 
ern Pacific, where the thermocline is shallow 
(about 50 metres below the surface), further 
weakening the trade winds. Thus, tropical cli- 
mate signals such as increases in SST can grow 
rapidly to become some of the most important 
on Earth. The dynamic interactions between 
SST, trade winds and the thermocline are 
termed the Bjerknes feedback’, and form the 
foundation of our understanding of tropical- 
climate phenomena on seasonal and longer 
timescales. 

But how essential are these interactions? 
Clement et al.’ have investigated this issue by 
comparing the results from a pair of experi- 
ments, each of which simulated tropical cli- 
mate using 13 models. In the first experiment, 
the models coupled the atmosphere thermo- 
dynamically to a motionless slab of ocean 
through heat and moisture fluxes. Because 
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are rare. So further studies are required to 
determine the relevance of this NAIP speci- 
ficity in humans. The genomic difference 
between humans and mice in terms of Naip 
copy number indicates divergent evolution 
of this particular arm of the innate immune 
system. Who knows, humans may have a 
superfamily of NLRs that determine inflam- 
masome specificity in a similar way to the 
NAIPs in mice. = 
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the ocean was motionless, this excluded the 
influence of ocean dynamics, thus disabling 
the Bjerknes feedback. Surprisingly, these 
models still reproduced the Southern Oscilla- 
tion — the large-scale oscillation in sea-level 
atmospheric pressure that occurs between the 
eastern and western Pacific, and that is usually 
viewed as an atmospheric response to El Nifio 
through the Bjerknes feedback. 

The authors’ models show that, in the 
absence of ocean dynamics, the state of the 
ocean's mixed layer — the uppermost layer 
that is affected by turbulence caused by wind, 
evaporative effects and so on — is driven by 
stochastic effects from the atmosphere. The 
ocean responds to these high-frequency sig- 
nals from the atmosphere by turning them into 
lower-frequency effects, such as variations in 
SST. Statistical analysis of their data allowed 
the authors to plot SST spectra, which show 
how the magnitude of variation in SST changes 
with the period of the variation. In this way, the 
authors found that the variance of SST in their 
slab-ocean models increased with the period 
of the variation (to use the jargon, the SST 
spectrum was ‘red’). 

The ocean's role in converting stochastic 
forcing into red noise signals was proposed 
some time ago by Klaus Hasselmann’‘, and has 
often been applied to interpret mid-latitude 
SST variations where the ocean is inert — that 
is, where the ocean has a thick mixed layer 
and slow subsurface-wave propagation — and 
where the atmosphere behaves randomly and 
is not sensitive to the SST. But Hasselmann’s 
model has rarely been used to interpret atmos- 
pheric variability, and Clement et al. are the 
first to use it to explain the Southern Oscilla- 
tion. The authors’ results are reasonable and 
convincing. 
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Figure 1 | Spectra of the Southern Oscillation. The Southern Oscillation is the large-scale oscillation 
in sea-level atmospheric pressure that occurs between the eastern and western Pacific, and is usually 
viewed as an atmospheric response to anomalous sea surface warming (El Nifio) or cooling (La Nifia) 

in the central and eastern Pacific. a, Clement et al.” have used 13 models in which the atmosphere is 
thermodynamically coupled to a motionless, 50-metre-deep slab of ocean to plot spectra of the Southern 
Oscillation (shown). The spectra plot power — a dimensionless measurement of the magnitude of the 
difference in atmospheric pressure between the western and eastern Pacific — against the period of the 
oscillation. The blue line denotes the multi-model average, whereas the grey lines indicate results from 
individual models. In these models, the magnitude of the Southern Oscillation increases with the period 
of the oscillation. b, The authors also plotted spectra using another 13 climate models in which full ocean 
dynamics are dynamically and thermodynamically coupled to the same atmospheric models as those 
used in a. These spectra show that the strongest variations in the Southern Oscillation occur roughly 
every 2-8 years. The difference between the two spectra indicates the role of coupled ocean dynamics in 
generating the variability of the Southern Oscillation at different timescales. (Data courtesy of A. Clement.) 


Because the tropical atmosphere is influ- 
enced by SST, the ‘redness’ of SST variation 
is reflected in the spectra of the Southern 
Oscillation, which show how the magni- 
tude of the difference in sea-level atmos- 
pheric pressure between the western and 
eastern tropical Pacific varies with the 
period of the oscillation (Fig. 1). Thus, inter- 
actions between the tropical atmosphere 
and ocean help to generate coupled climate 
modes. In the absence of the Bjerknes feed- 
back, the authors’ models describe a robust, 
thermally coupled mode whose structure 
resembles the El Nifio-Southern Oscilla- 
tion (ENSO) on both interannual timescales 
(periods in the range of two to ten years) and 
decadal or multidecadal timescales. 

Do Clement and colleagues’ findings suggest 
that the Bjerknes feedback and ocean dynam- 
ics are not important for tropical climate? Not 
necessarily. In their second experiment, they 
simulated tropical climate using 13 models 
in which the atmosphere is coupled to a fully 
dynamic ocean’. The interannual ENSO signal 
predicted by these models was, on average, two 
to three times stronger than that predicted by 
the slab-ocean models. What’s more, a previous 
study’ found that 19 out of 24 computer mod- 
els lacking coupled ocean dynamics severely 
underestimate the intensity of ENSO. These 
results, taken together with Clement and col- 
leagues’ findings’, explicitly affirm the classic 
ENSO theory involving coupled ocean dynam- 
ics that underpins extensive current research 
into ENSO and global-climate forecasts on 


seasonal-to-interannual timescales. How- 
ever, on decadal and longer timescales, ocean 
dynamics seem to be less important than was 
thought: the magnitude of the variance in SST 
in the climate mode observed by Clement 
et al.” in their ocean-slab models is comparable 
to that observed in reality. When the authors 
coupled full ocean dynamics to the atmos- 
phere in their models, this actually weakened 
the decadal and multidecadal variability by 
constraining the variance of tropical climate 
to interannual timescales. 

The underlying mechanisms of decadal and 
multidecadal climate variation in the tropical 
Pacific are unknown. A lack of ocean observa- 
tions means that the precise roles of the ocean 
— particularly the role of subsurface signals 
originating from the extratropics, the regions 
poleward of the tropics — remain controver- 
sial. There is even debate over whether the 
decadal ENSO-like mode is a physical entity 
or an artefact arising from the nonlinearity of 
ocean-atmosphere coupling’. Similar debates 
have been sparked by attempts to detect and 
project tropical Pacific climate change under 
global warming”*: will increased anthropo- 
genic greenhouse-gas emissions lead to an El 
Nifio-like state, or the opposite (a La Nifia-like 
state)? Clement and colleagues’ work supports 
neither of the two scenarios, but questions the 
importance of the Bjerknes feedback, the core 
element of prevailing hypotheses. 

One limitation of the authors’ findings’ is 
that many of their climate models are severely 
flawed when simulating the period and 
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50 Years Ago 


The circumstances of our age 
seem to present us with a very real 
problem which may fittingly be 
discussed under the title of 
“Science and the Classics”. In 
response to an international 
challenge we are engaged in 
expanding our educational 
services. The urgent need is for 
more scientists, more technologists, 
more engineers. This means a 
relative, if not an absolute, decline 
in the study of the humanities. Is 
this shift in the character of our 
education good or bad? Is there 
anything in the claim that the study 
of the humanities produced a more 
balanced type of man? Is there 

in the scientific mode of training 
any tendency against which it is 
desirable to be on our guard? I 
think that there is a tendency 
among scientists to suppose that 
the physical world alone is real, 
and that the very successful 
methods they have devised for 
dealing with it are applicable to 
every aspect of life. This, to my 
way of thinking, is a dangerous 
illusion. 

From Nature 30 September 1961 


100 Years Ago 


Exhaustive tests have been 

made during the last two weeks 

by Mr. A. W. Sharman with 
instruments invented by him for 
telephoning through water without 
wires. A small telephone station has 
been erected in a room in an hotel 
on the cliffs at Pegwell Bay, and the 
other station has been fitted up on 
a motor-boat cruising in various 
parts of the bay ... The speech 
transmitted through the water has 
been very distinct, and the system 
has shown good possibilities of its 
being used as a means of verbal 
communication between two 
ships, such asa battleship anda 
submarine. 

From Nature 28 September 1911 
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magnitude of ENSO. The misrepresentation 
of ENSO dynamics in such models may pre- 
clude an accurate separation of dynamic and 
thermodynamic coupling effects. Moreover, 
most climate models erroneously predict the 
existence of an intertropical convergence zone 
(ITCZ) — a band near the Equator where the 
trade winds from the two hemispheres con- 
verge — in the South Pacific, in addition to 
the real one observed north of the Equator. 
This problem is known as double ITCZ bias. 
By raising the oceanic thermocline, ITCZs can 
effectively prevent extratropical ocean signals 
from penetrating into the equatorial Pacific’, 
both in models and in the real world. Therefore, 


CARDIOVASCULAR BIOLOGY 


the ocean’s role in decadal and multidecadal 
ENSO-like variations might be underestimated 
in models featuring double ITCZ bias. Increas- 
ing the space and time coverage of ocean obser- 
vations will be crucial to address this possibility. 
Similar experiments to those of Clement et al., 
conducted using improved climate models, 
are also needed to validate and refine the cur- 
rent results”. Despite these caveats, the authors’ 
findings may help to reshape our view not only 
of today’s climate, but also of past and future 
changes in tropical climate. = 


Jing-Jia Luo is at the Research Institute 
for Global Change, JAMSTEC, 3173-25 


Heart fails without 
pump partner 


Mishandling of calcium ions by cardiac cells causes the heart to malfunction. The 
discovery ofa crucial modification to a calcium pump inside the cell opens up a 
potential way to correct this. SEE LETTER P.601 


SUDHA K. SHENOY & HOWARD A. ROCKMAN 


he activity of SERCA2a, a crucial 
membrane protein located inside car- 
diac muscle cells (myocytes), is com- 
promised in heart failure. SERCA2a acts as a 
calcium-ion pump, being responsible for the 
uptake of calcium ions during excitation- 
contraction coupling, a process that connects 
electrical excitation of myocytes with con- 
traction of the heart muscle. On page 601 of 
this issue’, Kho and colleagues show that 
SERCA 2a, which is present on the mem- 
brane of a cellular organelle known as the 
sarcoplasmic reticulum, is modified after its 
translation by the attachment of another pro- 
tein known as SUMO1. This ‘SUMOylatior’ 
stabilizes the SERCA2a pump and maintains 
normal functioning of the heart, which is 
disrupted if the attachment fails. 

Faulty flux of calcium ions (Ca”*) inside the 
cardiac myocyte is a central cause of contrac- 
tile dysfunction and arrhythmias in patho- 
physiological heart conditions. This is because 
Ca” ions are essential for cardiac electrical 
activity and directly activate the elements of 
the myocyte that cause contraction. 

Figure 1 shows how these events occur in 
a normal cardiac myocyte. In response to 
the wave of depolarization generated by the 
arrival of an action potential at the myocyte 
membrane, Ca” ions enter the cell through 
voltage-gated L-type Ca™* channels in the 
membrane. This ‘activator’ Ca” triggers the 
release of a large amount of Ca™* stored in the 


sarcoplasmic reticulum in a process known as 
Ca’*-induced Ca” release, as well as generating 
a small Ca’*-release event from the sarcoplas- 
mic reticulum (a Ca” ‘spark’)*”. During excita- 
tion-contraction coupling, several thousand 
Ca™ sparks in each cell are synchronized by the 
action potential to increase intracellular Ca’*, 
which bathes the muscle’s contractile myofila- 
ments and causes the heart cell to contract. To 
initiate relaxation of the heart, Ca”* is removed 
from the cytoplasm, primarily through the 
action of SERCA2a (ref. 2), which hydrolyses 
the cellular energy molecule ATP to pump Ca™* 
ions back into the sarcoplasmic reticulum. 

SERCA2a activity is stimulated by the 
hormone noradrenaline, released from sym- 
pathetic nerves, through phosphorylation of 
a protein called phospholamban” and by the 
SUMOylation mechanism reported by Kho 
et al.' (Fig. 1). Deficits in SERCA2a activity 
and expression are associated with heart failure 
in animal models and in humans, indicating 
that it could be a critical molecule in the 
pathogenesis of cardiac failure’. 

SUMO] (small ubiquitin-like modifier 1) 
is related to the well-known protein tag ubiq- 
uitin. Their three-dimensional structures are 
remarkably similar, despite there being little 
sequence conservation between them®. Both 
SUMOylation and ubiquitination involve 
dynamic and reversible covalent post-trans- 
lational modification of lysine amino-acid 
residues in their substrate proteins, leading to 
changes in stability, activity and/or subcellular 
localization of the modified proteins. In 
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general, SUMOylation does not mark substrate 
proteins for degradation, whereas ubiquitina- 
tion does. The two tags may even counter- 
act one another by competing for the same 
lysine residues to define the fate of substrate 
proteins’. Discovered in 1996, some 16 years 
after ubiquitination, SUMOylation has quickly 
proved its importance in processes as diverse 
as protein transport, gene transcription, DNA 
replication and repair, and chromosome 
segregation”. 

Both SUMO and ubiquitin are activated 
by an enzyme called El to associate with a 
carrier/conjugating enzyme, E2; they are 
both appended to their substrates by a ligat- 
ing enzyme, E3. The striking difference in 
the two pathways is that, whereas SUMO con- 
jugation uses a single E2, roughly 40 E2s are 
active in ubiquitination. Furthermore, only 
one isoform of ubiquitin exists in cells, but 
there are four SUMO isoforms — of which 
SUMOs 1-3 could be involved in substrate 
SUMOylation®. 

Kho et al. have discovered that SERCA2a 
in the heart is SUMOylated by SUMO] at 
two lysine residues (at positions 480 and 
585) within its ATP-binding domain’. They 
found that mutating both lysines prevents 
SUMOylation, enhances ubiquitination and 
degradation of SERCA2a and — surprisingly 
— reduces ATP binding. They conclude that 
SUMOylation of SERCA2a could be a pre- 
requisite for the protein’s stability as well as for 
its Ca**-pumping activity. 

The authors found that failing hearts have 
decreased amounts of both SUMO1 and 
SERCA2a. In an animal model of heart failure, 
overexpression of SERCA2a or SUMO1 led toa 
reversal of heart failure; however, in an animal 
model deficient in SERCA2a, re-expression of 
SUMO1 alone did not restore heart function. 
Although several cardiac proteins (transcrip- 
tion factors, for example) and processes (such 
as gene transcription) could be affected by 
reduced levels of SUMO1, Kho and colleagues’ 
findings indicate that only the recovery of 
SERCA2a SUMOylation can reverse deterior- 
ation of the heart. Another cardiac protein, 
lamin A, also undergoes SUM Oylation at sites 
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Figure 1 | Spectra of the Southern Oscillation. The Southern Oscillation is the large-scale oscillation 
in sea-level atmospheric pressure that occurs between the eastern and western Pacific, and is usually 
viewed as an atmospheric response to anomalous sea surface warming (El Nifio) or cooling (La Nifia) 

in the central and eastern Pacific. a, Clement et al.” have used 13 models in which the atmosphere is 
thermodynamically coupled to a motionless, 50-metre-deep slab of ocean to plot spectra of the Southern 
Oscillation (shown). The spectra plot power — a dimensionless measurement of the magnitude of the 
difference in atmospheric pressure between the western and eastern Pacific — against the period of the 
oscillation. The blue line denotes the multi-model average, whereas the grey lines indicate results from 
individual models. In these models, the magnitude of the Southern Oscillation increases with the period 
of the oscillation. b, The authors also plotted spectra using another 13 climate models in which full ocean 
dynamics are dynamically and thermodynamically coupled to the same atmospheric models as those 
used in a. These spectra show that the strongest variations in the Southern Oscillation occur roughly 
every 2-8 years. The difference between the two spectra indicates the role of coupled ocean dynamics in 
generating the variability of the Southern Oscillation at different timescales. (Data courtesy of A. Clement.) 


Because the tropical atmosphere is influ- 
enced by SST, the ‘redness’ of SST variation 
is reflected in the spectra of the Southern 
Oscillation, which show how the magni- 
tude of the difference in sea-level atmos- 
pheric pressure between the western and 
eastern tropical Pacific varies with the 
period of the oscillation (Fig. 1). Thus, inter- 
actions between the tropical atmosphere 
and ocean help to generate coupled climate 
modes. In the absence of the Bjerknes feed- 
back, the authors’ models describe a robust, 
thermally coupled mode whose structure 
resembles the El Nifio-Southern Oscilla- 
tion (ENSO) on both interannual timescales 
(periods in the range of two to ten years) and 
decadal or multidecadal timescales. 

Do Clement and colleagues’ findings suggest 
that the Bjerknes feedback and ocean dynam- 
ics are not important for tropical climate? Not 
necessarily. In their second experiment, they 
simulated tropical climate using 13 models 
in which the atmosphere is coupled to a fully 
dynamic ocean’. The interannual ENSO signal 
predicted by these models was, on average, two 
to three times stronger than that predicted by 
the slab-ocean models. What’s more, a previous 
study’ found that 19 out of 24 computer mod- 
els lacking coupled ocean dynamics severely 
underestimate the intensity of ENSO. These 
results, taken together with Clement and col- 
leagues’ findings’, explicitly affirm the classic 
ENSO theory involving coupled ocean dynam- 
ics that underpins extensive current research 
into ENSO and global-climate forecasts on 


seasonal-to-interannual timescales. How- 
ever, on decadal and longer timescales, ocean 
dynamics seem to be less important than was 
thought: the magnitude of the variance in SST 
in the climate mode observed by Clement 
et al.” in their ocean-slab models is comparable 
to that observed in reality. When the authors 
coupled full ocean dynamics to the atmos- 
phere in their models, this actually weakened 
the decadal and multidecadal variability by 
constraining the variance of tropical climate 
to interannual timescales. 

The underlying mechanisms of decadal and 
multidecadal climate variation in the tropical 
Pacific are unknown. A lack of ocean observa- 
tions means that the precise roles of the ocean 
— particularly the role of subsurface signals 
originating from the extratropics, the regions 
poleward of the tropics — remain controver- 
sial. There is even debate over whether the 
decadal ENSO-like mode is a physical entity 
or an artefact arising from the nonlinearity of 
ocean-atmosphere coupling’. Similar debates 
have been sparked by attempts to detect and 
project tropical Pacific climate change under 
global warming”*: will increased anthropo- 
genic greenhouse-gas emissions lead to an El 
Nifio-like state, or the opposite (a La Nifia-like 
state)? Clement and colleagues’ work supports 
neither of the two scenarios, but questions the 
importance of the Bjerknes feedback, the core 
element of prevailing hypotheses. 

One limitation of the authors’ findings’ is 
that many of their climate models are severely 
flawed when simulating the period and 
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The circumstances of our age 
seem to present us with a very real 
problem which may fittingly be 
discussed under the title of 
“Science and the Classics”. In 
response to an international 
challenge we are engaged in 
expanding our educational 
services. The urgent need is for 
more scientists, more technologists, 
more engineers. This means a 
relative, if not an absolute, decline 
in the study of the humanities. Is 
this shift in the character of our 
education good or bad? Is there 
anything in the claim that the study 
of the humanities produced a more 
balanced type of man? Is there 

in the scientific mode of training 
any tendency against which it is 
desirable to be on our guard? I 
think that there is a tendency 
among scientists to suppose that 
the physical world alone is real, 
and that the very successful 
methods they have devised for 
dealing with it are applicable to 
every aspect of life. This, to my 
way of thinking, is a dangerous 
illusion. 

From Nature 30 September 1961 


100 Years Ago 


Exhaustive tests have been 

made during the last two weeks 

by Mr. A. W. Sharman with 
instruments invented by him for 
telephoning through water without 
wires. A small telephone station has 
been erected in a room in an hotel 
on the cliffs at Pegwell Bay, and the 
other station has been fitted up on 
a motor-boat cruising in various 
parts of the bay ... The speech 
transmitted through the water has 
been very distinct, and the system 
has shown good possibilities of its 
being used as a means of verbal 
communication between two 
ships, such asa battleship anda 
submarine. 

From Nature 28 September 1911 
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magnitude of ENSO. The misrepresentation 
of ENSO dynamics in such models may pre- 
clude an accurate separation of dynamic and 
thermodynamic coupling effects. Moreover, 
most climate models erroneously predict the 
existence of an intertropical convergence zone 
(ITCZ) — a band near the Equator where the 
trade winds from the two hemispheres con- 
verge — in the South Pacific, in addition to 
the real one observed north of the Equator. 
This problem is known as double ITCZ bias. 
By raising the oceanic thermocline, ITCZs can 
effectively prevent extratropical ocean signals 
from penetrating into the equatorial Pacific’, 
both in models and in the real world. Therefore, 


CARDIOVASCULAR BIOLOGY 


the ocean’s role in decadal and multidecadal 
ENSO-like variations might be underestimated 
in models featuring double ITCZ bias. Increas- 
ing the space and time coverage of ocean obser- 
vations will be crucial to address this possibility. 
Similar experiments to those of Clement et al., 
conducted using improved climate models, 
are also needed to validate and refine the cur- 
rent results”. Despite these caveats, the authors’ 
findings may help to reshape our view not only 
of today’s climate, but also of past and future 
changes in tropical climate. = 


Jing-Jia Luo is at the Research Institute 
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Heart fails without 
pump partner 


Mishandling of calcium ions by cardiac cells causes the heart to malfunction. The 
discovery ofa crucial modification to a calcium pump inside the cell opens up a 
potential way to correct this. SEE LETTER P.601 


SUDHA K. SHENOY & HOWARD A. ROCKMAN 


he activity of SERCA2a, a crucial 
membrane protein located inside car- 
diac muscle cells (myocytes), is com- 
promised in heart failure. SERCA2a acts as a 
calcium-ion pump, being responsible for the 
uptake of calcium ions during excitation- 
contraction coupling, a process that connects 
electrical excitation of myocytes with con- 
traction of the heart muscle. On page 601 of 
this issue’, Kho and colleagues show that 
SERCA 2a, which is present on the mem- 
brane of a cellular organelle known as the 
sarcoplasmic reticulum, is modified after its 
translation by the attachment of another pro- 
tein known as SUMO1. This ‘SUMOylatior’ 
stabilizes the SERCA2a pump and maintains 
normal functioning of the heart, which is 
disrupted if the attachment fails. 

Faulty flux of calcium ions (Ca”*) inside the 
cardiac myocyte is a central cause of contrac- 
tile dysfunction and arrhythmias in patho- 
physiological heart conditions. This is because 
Ca” ions are essential for cardiac electrical 
activity and directly activate the elements of 
the myocyte that cause contraction. 

Figure 1 shows how these events occur in 
a normal cardiac myocyte. In response to 
the wave of depolarization generated by the 
arrival of an action potential at the myocyte 
membrane, Ca” ions enter the cell through 
voltage-gated L-type Ca™* channels in the 
membrane. This ‘activator’ Ca” triggers the 
release of a large amount of Ca™* stored in the 


sarcoplasmic reticulum in a process known as 
Ca’*-induced Ca” release, as well as generating 
a small Ca’*-release event from the sarcoplas- 
mic reticulum (a Ca” ‘spark’)*”. During excita- 
tion-contraction coupling, several thousand 
Ca™ sparks in each cell are synchronized by the 
action potential to increase intracellular Ca’*, 
which bathes the muscle’s contractile myofila- 
ments and causes the heart cell to contract. To 
initiate relaxation of the heart, Ca”* is removed 
from the cytoplasm, primarily through the 
action of SERCA2a (ref. 2), which hydrolyses 
the cellular energy molecule ATP to pump Ca™* 
ions back into the sarcoplasmic reticulum. 

SERCA2a activity is stimulated by the 
hormone noradrenaline, released from sym- 
pathetic nerves, through phosphorylation of 
a protein called phospholamban” and by the 
SUMOylation mechanism reported by Kho 
et al.' (Fig. 1). Deficits in SERCA2a activity 
and expression are associated with heart failure 
in animal models and in humans, indicating 
that it could be a critical molecule in the 
pathogenesis of cardiac failure’. 

SUMO] (small ubiquitin-like modifier 1) 
is related to the well-known protein tag ubiq- 
uitin. Their three-dimensional structures are 
remarkably similar, despite there being little 
sequence conservation between them®. Both 
SUMOylation and ubiquitination involve 
dynamic and reversible covalent post-trans- 
lational modification of lysine amino-acid 
residues in their substrate proteins, leading to 
changes in stability, activity and/or subcellular 
localization of the modified proteins. In 
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general, SUMOylation does not mark substrate 
proteins for degradation, whereas ubiquitina- 
tion does. The two tags may even counter- 
act one another by competing for the same 
lysine residues to define the fate of substrate 
proteins’. Discovered in 1996, some 16 years 
after ubiquitination, SUMOylation has quickly 
proved its importance in processes as diverse 
as protein transport, gene transcription, DNA 
replication and repair, and chromosome 
segregation”. 

Both SUMO and ubiquitin are activated 
by an enzyme called El to associate with a 
carrier/conjugating enzyme, E2; they are 
both appended to their substrates by a ligat- 
ing enzyme, E3. The striking difference in 
the two pathways is that, whereas SUMO con- 
jugation uses a single E2, roughly 40 E2s are 
active in ubiquitination. Furthermore, only 
one isoform of ubiquitin exists in cells, but 
there are four SUMO isoforms — of which 
SUMOs 1-3 could be involved in substrate 
SUMOylation®. 

Kho et al. have discovered that SERCA2a 
in the heart is SUMOylated by SUMO] at 
two lysine residues (at positions 480 and 
585) within its ATP-binding domain’. They 
found that mutating both lysines prevents 
SUMOylation, enhances ubiquitination and 
degradation of SERCA2a and — surprisingly 
— reduces ATP binding. They conclude that 
SUMOylation of SERCA2a could be a pre- 
requisite for the protein’s stability as well as for 
its Ca**-pumping activity. 

The authors found that failing hearts have 
decreased amounts of both SUMO1 and 
SERCA2a. In an animal model of heart failure, 
overexpression of SERCA2a or SUMO1 led toa 
reversal of heart failure; however, in an animal 
model deficient in SERCA2a, re-expression of 
SUMO1 alone did not restore heart function. 
Although several cardiac proteins (transcrip- 
tion factors, for example) and processes (such 
as gene transcription) could be affected by 
reduced levels of SUMO1, Kho and colleagues’ 
findings indicate that only the recovery of 
SERCA2a SUMOylation can reverse deterior- 
ation of the heart. Another cardiac protein, 
lamin A, also undergoes SUM Oylation at sites 


Healthy cardiac myocyte 


Relaxation 


Figure 1 | The SERCA 2a calcium pump and its modification by SUMO1. Ina healthy cardiac 
myocyte, noradrenaline (NE) released from sympathetic nerves stimulates B-adrenergic receptors 
(B-AR) to enhance the activity of L-type calcium-ion (Ca”*) channels on the cell membrane, leading to 
Ca” entry into the cell. This in turn triggers the release of a large amount of Ca™ from the sarcoplasmic 
reticulum into the cytoplasm, and leads to myofilament contraction. Noradrenaline stimulation also 
causes phosphorylation (P) of phospholamban (PLN), releasing its inhibitory effect on the sarcoplasmic- 
reticulum Ca’*-ATPase pump, SERCA2a. Activated SERCA2a binds Ca” ions from the cytoplasm and 
pumps them back into the sarcoplasmic reticulum, allowing the myofilaments to relax. Kho et al.’ show 
that only SERCA2a modified by SUMO] (S) can transport Ca™ ions and maintain a healthy heart. In 

a failing heart (not shown), SUMO1 is depleted and SERCA2a remains in the un-SUMOylated form. 

It therefore cannot bind ATP and cannot effectively pump Ca™ back into the sarcoplasmic-reticulum 


lumen, so relaxation of cardiac muscle is impaired. 


that are known to be mutated in patients with 
familial dilated cardiomyopathy and those 
with conduction-system disease, suggesting 
that these disorders may also be attributable 
to defective SUMOylation’. 

However, SUMO] deficiency also causes 
degradation of the transcription factor SRF 
by caspase enzymes’, which are involved in 
programmed cell death. Therefore, restoring 
SUMO1 activity by introducing the SUMO1 
transgene may block this and other cell-death 
pathways in the failing heart and lead to the 
observed recovery. Further investigation is 
needed to determine the cause of the reduced 
SUMO1 levels seen in heart failure: the protein 
does not undergo degradation and there is no 
evidence that gene transcription is inhibited. 

In contrast to the stimulation of SERCA2a 
by SUMO1, some potassium-ion chan- 
nels in the cell membrane are inhibited by 
SUMOylation’*"; they regain their ion-selec- 
tive activity when SUMOylation is blocked 
or reversed. Furthermore, there is evidence 
that SUMOylation could be critical for trans- 
port processes into the nucleus®”’. It might 
therefore also facilitate Ca** transport across 
membranes, for example by imparting struc- 
tural features of an ‘on/off’ switch to the ion 
channels. 

Kho et al.' suggest that targeting SUMO1 
could be an attractive approach to modifying 
specific pathogenic molecular pathways in 
heart failure. Although the results look prom- 
ising, some caution is warranted. SUMO1 is 


ubiquitous in its expression and biochemical 
function, and many of the heart’s regulatory 
proteins and transcription factors are known 
to be SUMOylated””’, which could interfere 
with target specificity. Dysregulation of SUMO 
conjugation can contribute to the progression 
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of some human cancers®. Also, SUMO1 
might bind to other proteins that modulate 
SERCA2a either directly or indirectly, lead- 
ing to unwanted changes in Ca” handling”. 
Nonetheless, therapeutic targeting of SUMO1 
to treat heart failure remains an attractive 
proposition and should lead to new areas of 
investigation’. m 
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A dent in carbon’s 
gold standard 


The global uptake of carbon by land plants may be greater than previously 
thought, according to observations based on the enigmatic Keeling curve of 
rising atmospheric carbon dioxide. SEE LETTER P.579 


MATTHIAS CUNTZ 


stimates of how much carbon is taken 
Be each year by the world’s land plants 

are derived mainly from models of 
the carbon cycle. Worldwide measurements 
of terrestrial carbon exchange have yielded 
an estimate’ of this global carbon uptake as 
123 +8 petagrams carbon per year (Pg C yr’; 
1 Pg is 10° g). This is so close to earlier esti- 
mates derived from models and biomass 
production that 120 Pg C yr“! can be taken 


as carbon’s ‘gold standard’ But Welp and col- 
leagues’ remind us, on page 579 of this issue, 
that we should not be complacent — land 
ecosystems might be taking in considerably 
more carbon than we thought. 

Our atmosphere is a perfect blender. 
Changes in its levels of trace gases — such 
as carbon dioxide — reveal variations in the 
total influx and uptake of its constituents. 
So if you measure the carbon exchange of a 
forest ecosystem, for example, you get the net 
exchange of all the carbon taken up by the trees 
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Figure 1 | The SERCA 2a calcium pump and its modification by SUMO1. Ina healthy cardiac 
myocyte, noradrenaline (NE) released from sympathetic nerves stimulates B-adrenergic receptors 
(B-AR) to enhance the activity of L-type calcium-ion (Ca”*) channels on the cell membrane, leading to 
Ca” entry into the cell. This in turn triggers the release of a large amount of Ca™ from the sarcoplasmic 
reticulum into the cytoplasm, and leads to myofilament contraction. Noradrenaline stimulation also 
causes phosphorylation (P) of phospholamban (PLN), releasing its inhibitory effect on the sarcoplasmic- 
reticulum Ca’*-ATPase pump, SERCA2a. Activated SERCA2a binds Ca” ions from the cytoplasm and 
pumps them back into the sarcoplasmic reticulum, allowing the myofilaments to relax. Kho et al.’ show 
that only SERCA2a modified by SUMO] (S) can transport Ca™ ions and maintain a healthy heart. In 

a failing heart (not shown), SUMO1 is depleted and SERCA2a remains in the un-SUMOylated form. 

It therefore cannot bind ATP and cannot effectively pump Ca™ back into the sarcoplasmic-reticulum 


lumen, so relaxation of cardiac muscle is impaired. 


that are known to be mutated in patients with 
familial dilated cardiomyopathy and those 
with conduction-system disease, suggesting 
that these disorders may also be attributable 
to defective SUMOylation’. 

However, SUMO] deficiency also causes 
degradation of the transcription factor SRF 
by caspase enzymes’, which are involved in 
programmed cell death. Therefore, restoring 
SUMO1 activity by introducing the SUMO1 
transgene may block this and other cell-death 
pathways in the failing heart and lead to the 
observed recovery. Further investigation is 
needed to determine the cause of the reduced 
SUMO1 levels seen in heart failure: the protein 
does not undergo degradation and there is no 
evidence that gene transcription is inhibited. 

In contrast to the stimulation of SERCA2a 
by SUMO1, some potassium-ion chan- 
nels in the cell membrane are inhibited by 
SUMOylation’*"; they regain their ion-selec- 
tive activity when SUMOylation is blocked 
or reversed. Furthermore, there is evidence 
that SUMOylation could be critical for trans- 
port processes into the nucleus®”’. It might 
therefore also facilitate Ca** transport across 
membranes, for example by imparting struc- 
tural features of an ‘on/off’ switch to the ion 
channels. 

Kho et al.' suggest that targeting SUMO1 
could be an attractive approach to modifying 
specific pathogenic molecular pathways in 
heart failure. Although the results look prom- 
ising, some caution is warranted. SUMO1 is 


ubiquitous in its expression and biochemical 
function, and many of the heart’s regulatory 
proteins and transcription factors are known 
to be SUMOylated””’, which could interfere 
with target specificity. Dysregulation of SUMO 
conjugation can contribute to the progression 
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of some human cancers®. Also, SUMO1 
might bind to other proteins that modulate 
SERCA2a either directly or indirectly, lead- 
ing to unwanted changes in Ca” handling”. 
Nonetheless, therapeutic targeting of SUMO1 
to treat heart failure remains an attractive 
proposition and should lead to new areas of 
investigation’. m 
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of terrestrial carbon exchange have yielded 
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1 Pg is 10° g). This is so close to earlier esti- 
mates derived from models and biomass 
production that 120 Pg C yr“! can be taken 


as carbon’s ‘gold standard’ But Welp and col- 
leagues’ remind us, on page 579 of this issue, 
that we should not be complacent — land 
ecosystems might be taking in considerably 
more carbon than we thought. 

Our atmosphere is a perfect blender. 
Changes in its levels of trace gases — such 
as carbon dioxide — reveal variations in the 
total influx and uptake of its constituents. 
So if you measure the carbon exchange of a 
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for photosynthesis and all the 


fix carbon more efficiently 


carbon released by the trees 
and soils through respiration. 
These gross-exchange fluxes 


through the C, metabolic path- 
way, rather than by the usual C, 
route, thereby enabling them 
to fix about 60% of the CO, 
molecules that enter the plant. 
Hence, the global abundance 
and distribution of C, plants 
are important in estimates of 
global productivity, whether 


these are derived from mod- 


— photosynthesis and respira- a [ 
: = 
tion — are much larger than ae ie 
the net ecosystem exchange < 360- 
that is actually measured. On S 2a 
the global scale, the net flux ool 
is only a few per cent of the soc 
gross fluxes. Because small 
: f -7.0 

changes in photosynthesis and | b 


respiration can have big con- 
sequences for the net carbon 
uptake of terrestrial ecosys- 
tems, the interplay between 
photosynthesis and respiration 
must be well described in car- 
bon-cycle models if they are to 


313C (Mc) 


elling, actual measurements 
or isotope-composition data. 
One carbon-cycle model, for 
example, increased global 
productivity by more than 
20% simply by substitut- 
ing a new map of C,-plant 
1 distribution’. 


reliably project into the future. 


So it looks as though we are 


It is, however, almost impos- 
sible to measure individual 
components on scales larger 
than the size of a leaf, let alone 
on a regional or continental 
scale. 

This is where Welp et al.” 


350 (Yc) 


take advantage of the compo- ; es 


: 


sition of oxygen isotopes in 


‘ 


aes 


stuck with model-based esti- 
1 mates that are hard to validate 
a globally. But other isotopes 
| might yet come to the rescue: 
4 the isotopic composition of the 
| carbon atoms in CO, provides 
2 a measure of the percentage of 
] carbon that is fixed*. This could 
constrain estimates such as that 


5 
CO, — the chemical signature 1980 
of which changes if one '°O 
oxygen atom in CO, is replaced 
by a heavier '*O atom. Carbon 
dioxide dissolves in water and 
exchanges its oxygen with 
water's oxygen to equilibrium, 
so CO, is tagged by the water 
it comes into contact with. 
Different waters have distinct 
isotopic compositions owing 
to evaporation processes in 
soils and leaves — the lighter 
molecules evaporate faster, and 
the heavier ones fall behind. 
Asa result, the oxygen isotopic 
composition in CO, is very 
sensitive to photosynthesis and 
respiration: more photosynthesis means more 
‘80, and hence higher oxygen-isotope ratios in 
the atmosphere. 

Using an impressive 30-year record of the 
isotopic composition of atmospheric CO,, 
Welp et al.’ assess the mean atmospheric resi- 
dence time for oxygen atoms in CO,. Their 
11 time series were started in the 1970s by the 
late Charles Keeling, alongside the famous 
record of total atmospheric CO, at Mauna 
Loa in Hawaii (Fig. 1). Welp et al. identified a 
strong correlation between the observed inter- 
annual variability of the oxygen isotopes and 
the El Nifo-Southern Oscillation (ENSO). 
Such a correlation has previously been estab- 
lished for the isotopic composition of water’ 
and, consequently, is now found in the oxy- 
gen isotopes of CO, as well. From the mean 
residence time of the oxygen atoms in CO,, 
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Figure 1 | Atmospheric CO, concentrations and isotope composition measured 

at Mauna Loa, Hawaii. a, Keeling curve of CO, concentrations since the 1980s 

(ref. 7). Dots are single measurements or daily averages; line indicates the long-term 
trend. b, Carbon-isotope composition of the CO, at Mauna Loa. Here, 5’°C is the 
deviation of the '°C/'°C ratio from a standard value. Because the carbon cycle is the 
major influence on both CO, concentrations and 8C/"C ratios, the curves ina and b 
correlate well with each other (that is, the downward trend in b mirrors the upward 
trend in a and so do the seasonal variations). c, The oxygen-isotope composition of 
the CO, is influenced not only by the carbon cycle, but also by the water cycle, and so 
does not correlate simply with CO, concentration; 5'*O is the deviation of the '°O/'°O 
ratio from a standard value. Welp and colleagues’ find that the interannual variations 
in 5'°O correlate with the El Nifio-Southern Oscillation (arrows indicate El Niio 
events). Their analysis of the oxygen-isotope data also provides a new estimate of 
global carbon uptake on land. p.p.m., parts per million. (Data are publicly available on 
the Scripps Institution of Oceanography website’) 


Welp et al. arrive at a best guess of global 
productivity of 150-175 Pg C yr’ — some 
25-45% more than the gold standard. 

This inference hinges on a set of assump- 
tions and estimates. It depends, for example, 
on how many CO, molecules actually enter a 
plant before one molecule is fixed by photo- 
synthesis. The authors think that plants 
eventually fix some 43% of all CO, mol- 
ecules entering a leaf; however, if this were 
only 34%, the isotope-based estimate would 
fall to about 120 Pg C yr’, the current gold 
standard. 

The global value of 43% is derived from 
carbon-cycle models and remains uncer- 
tain, because it depends on the details of the 
models’ formulation. It also depends on the 
distribution of different plant types. For exam- 
ple, some savannah grasses and maize (corn) 
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2010 offered by Welp et al., but it 
could also constrain C,-plant 
distribution and therefore 
help non-isotopic estimates 
of global production as well. 
And the carbon-isotope 
estimate of the percentage of 
carbon that is fixed might be 
further refined with the help of 
carbonyl sulphide, a new tracer 
of leaves’ ability to take up 
CO, (ref. 6). 

Gold does not tarnish easily. 
With their approach, and 
by making their long-term 
records publicly available, 
Welp and colleagues’ are pre- 
paring the ground to combine 
these pieces of information and polish up 
carbon’s gold standard of the future. m 
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Crystal structure of the B. adrenergic 
receptor-Gs protein complex 


Soren G. F. Rasmussen’, Brian T. DeVree**, Yaozhong Zou', Andrew C. Kruse!, Ka Young Chung’, Tong Sun Kobilka’, 
Foon Sun Thian’, Pil Seok Chae’, Els Pardon®”®, Diane Calinski’, Jesper M. Mathiesen’, Syed T. A. Shah’, Joseph A. Lyons’, 
Martin Caffrey’, Samuel H. Gellman’, Jan Steyaert*°, Georgios Skiniotis®, William I. Weis'’, Roger K. Sunahara® & Brian K. Kobilka' 


G protein-coupled receptors (GPCRs) are responsible for the majority of cellular responses to hormones and 
neurotransmitters as well as the senses of sight, olfaction and taste. The paradigm of GPCR signalling is the activation 
of a heterotrimeric GTP binding protein (G protein) by an agonist -occupied receptor. The B2 adrenergic receptor (BAR) 
activation of Gs, the stimulatory G protein for adenylyl cyclase, has long been a model system for GPCR signalling. Here 
we present the crystal structure of the active state ternary complex composed of agonist- occupied monomeric BAR and 
nucleotide-free Gs heterotrimer. The principal interactions between the B,AR and Gs involve the amino- and 
carboxy-terminal a-helices of Gs, with conformational changes propagating to the nucleotide-binding pocket. The 
largest conformational changes in the B,AR include a 14A outward movement at the cytoplasmic end of 
transmembrane segment 6 (TM6) and an a-helical extension of the cytoplasmic end of TM5. The most surprising 
observation is a major displacement of the a-helical domain of Gas relative to the Ras-like GTPase domain. This 
crystal structure represents the first high-resolution view of transmembrane signalling by a GPCR. 


Introduction 


The , adrenergic receptor (§2AR) has been a model system for the 
large and diverse family of G protein-coupled receptors (GPCRs) for 
over 40 years. It was one of the first GPCRs to be characterized by 
radioligand binding, and it was the first neurotransmitter receptor to 
be cloned’ and structurally determined by crystallography’. The 
BAR was initially identified based on its physiological and phar- 
macological properties, but it was not known if receptors and G 
proteins were separate entities, or parts of the same protein’. 
Subsequent biochemical studies led to the isolation and purification 
of functional B,AR and Gs, the stimulatory G protein that activates 
adenylyl cyclase, and the reconstitution of this signalling complex in 
phospholipid vesicles**. The cooperative interactions of B,AR and Gs 
observed in ligand binding assays formed the foundation of the ternary 
complex model of GPCR activation”®. In the ternary complex consist- 
ing of agonist, receptor and G protein, the affinity of the receptor for 
agonist is enhanced and the specificity of the G protein for guanine 
nucleotides changes in favour of GIP over GDP. The GPCR field has 
evolved markedly since these initial studies. Isolation of the genes and 
cDNAs for the B,AR and other GPCRs using protein sequencing and 
expression cloning led to the expansion of the family by homology 
cloning. More recently, sequencing of the human genome led to the 
identification of over 800 GPCR genes’. Experimental tools for iden- 
tifying protein-protein interactions and for expression and silencing 
of genes have revealed a complex network of cellular signalling and 
regulatory pathways including G protein-independent activation of 
cytosolic kinases'®"’. Nevertheless, the B2AR continues to be a relevant 
model for most aspects of GPCR pharmacology, signalling and 
regulation. 


Notwithstanding the remarkable advances in this field, we still 
know relatively little about the structural basis for transmembrane 
signalling by GPCRs. Figure 1 shows the G protein cycle for the 
B,AR-Gs complex. Agonist binding to the B,AR promotes interac- 
tions with GDP-bound Gsay heterotrimer, leading to the exchange 
of GDP for GTP, and the functional dissociation of Gs into Ga-GTP 
and GBy subunits. The separate Ga-GTP and Gy subunits can 
modulate the activity of different cellular effectors (channels, kinases 
or other enzymes). The intrinsic GTPase activity of Gas leads to 
hydrolysis of GIP to GDP and the reassociation of Ga-GDP and 
Gy subunits, and the termination of signalling. The active state of 
a GPCR can be defined as that conformation that couples to and 
stabilizes a nucleotide-free G protein. In this agonist-f,AR-Gs ternary 
complex, Gs has a higher affinity for GIP than GDP, and the BAR has 
an approximately 100-fold higher affinity for agonists than does B,AR 
alone. In an effort to understand the structural basis for GPCR signal- 
ling, we crystallized the B.AR-Gs complex. 


Crystallization of the B,AR-Gs complex 


The first challenge for crystallogenesis was to prepare a stable B; AR-Gs 
complex in detergent solution. The B,AR and Gs couple efficiently in 
lipid bilayers, but not in detergents used to solubilize and purify these 
proteins. We found that a relatively stable B, AR-Gs complex could be 
prepared by mixing purified GDP-Gs (approximately 100 UM final 
concentration) with a molar excess of purified B,AR bound to a high 
affinity agonist (BI-167107, Boehringer Ingelheim)’* in dodecylmalto- 
side solution. Apyrase, a non-selective purine pyrophosphatase, was 
added to hydrolyse GDP released from Gs on forming a complex with 
the B.AR. Removal of GDP was essential because both GDP and GTP 
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Figure 1 | G protein cycle for the B,AR-Gs complex. a, Extracellular agonist 
binding to the BAR leads to conformational rearrangements of the 
cytoplasmic ends of transmembrane segments that enable the Gs heterotrimer 
(a, B, and y) to bind the receptor. GDP is released from the « subunit upon 
formation of B,AR-Gs complex. The GTP binds to the nucleotide-free « 
subunit resulting in dissociation of the « and By subunits from the receptor. 
The subunits regulate their respective effector proteins adenylyl cyclase (AC) 


can disrupt the high-affinity interaction between B,AR and Gs 
(Supplementary Fig. la). The complex was subsequently purified by 
sequential antibody affinity chromatography and size-exclusion chro- 
matography. The stability of the complex was enhanced by exchanging 
it into a recently developed maltose neopentyl glycol detergent MNG-3 
(NG310, Anatrace)'*. The complex could be incubated at room tem- 
perature for 24h without any noticeable degradation; however, initial 
efforts to crystallize the complex using sparse matrix screens in deter- 
gent micelles, bicelles and lipidic cubic phase (LCP) failed. 

To further assess the quality of the complex, we analysed the protein 
by single particle electron microscopy™. The results confirmed that the 
complex was monodisperse, but revealed two potential problems for 
obtaining diffraction of quality crystals. First, the detergent used to sta- 
bilize the complex formed a large micelle, leaving little polar surface on 
the extracellular side of the B,AR-Gs complex for the formation of 
crystal lattice contacts. Our initial approach to this problem, which 
was to generate antibodies to the extracellular surface, was not successful. 
As an alternative approach, we replaced the unstructured amino ter- 
minus of the B,AR with T4 lysozyme (T4L). We previously used T4L 
to facilitate crystallogenesis of the inactive B,AR by inserting T4L 
between the cytoplasmic ends of TM5 and TM6 (ref. 3). Several different 
amino-terminal fusion proteins were prepared and single particle elec- 
tron microscopy was used to identify a fusion with a relatively fixed 
orientation of T4L in relation to the B,AR. 

The second problem revealed by single particle electron micro- 
scopy analysis was increased variability in the positioning of the 
ot-helical component of the Gas subunit. Gas consists of two domains, 
the Ras-like GTPase domain (GasRas), which interacts with the B,AR 
and the Gf subunit, and the «-helical domain (GasAH)"™. The inter- 
face of the two Gas subdomains forms the nucleotide-binding pocket 
(Fig. 1), and electron microscopy two-dimensional (2D) averages and 
three-dimensional (3D) reconstructions show that in the absence of 
guanine nucleotide, GasAH has a variable position relative to the 
complex of T4L-B,AR-GasRas-Gfy (Fig. 1b)**. 

We attributed the variable position of GausAH to the empty 
nucleotide-binding pocket. However, as noted above both GDP and 
non-hydrolysable GTP analogues disrupt the 8, AR-Gs complex (Sup- 
plementary Fig. 1). The addition of the pyrophosphate analogue phos- 
phonoformate (foscarnet) led to a significant increase in stabiliza- 
tion of GasAH as determined by electron microscopy analysis of the 
detergent-solubilized complex™*. Crystallization trials were carried out 
in LCP using a modified monolein (7.7 MAG, see Methods) designed 
to accommodate the large hydrophilic component of the T4L-B,AR- 
Gs complex'*. Although we were able to obtain small crystals that 


550 | NATURE | VOL 477 | 29 SEPTEMBER 2011 


Activated G protein subunits 
regulate effector proteins 


- ATP 


GTP hydrolysis and b 
inactivation of Ga protein 


WP 


P.° 


and Ca** channels. The Gs heterotrimer reassembles from o and By subunits 
following hydrolysis of GIP to GDP in the « subunit. b, The purified 
nucleotide-free B,AR-Gs protein complex maintained in detergent micelles. 
The Gas subunit consists of two domains, the Ras domain («Ras) and the 
a-helical domain (#AH). Both are involved in nucleotide binding. In the 
nucleotide-free state, the AH domain has a variable position relative the «Ras 
domain. 


diffracted to 7 A, we were unable to improve their quality through 
the use of additives and other modifications. 

In an effort to generate an antibody that would further stabilize the 
complex and facilitate crystallogenesis, we crosslinked BAR and the 
Gs heterotrimer with a small, homobifunctional amine-reactive cross- 
linker and used this stabilized complex to immunize llamas. Llamas 
and other camelids produce antibodies devoid of light chains. The 
single domain antigen binding fragments of these heavy-chain-only 
antibodies, known as nanobodies, are small (15 kDa), rigid, and are 
easily cloned and expressed in Escherichia coli (Methods)’'*. We 
obtained a nanobody (Nb35) that binds to the complex and prevents 
dissociation of the complex by GTPyS (Supplementary Fig. 1). The 
T4L-B.AR-Gs-Nb35 complex was used to obtain crystals that grew 
to 250 lum (Supplementary Fig. 2) in LCP (7.7 MAG) and diffracted to 
2.9 A. A 3.2 A data set was obtained from 20 crystals and the structure 
was determined by molecular replacement (Methods). 

The B,AR-Gs complex crystallized in primitive monoclinic space 
group P2,, with a single complex in each asymmetric unit. Figure 2a 
shows the crystallographic packing interactions. Complexes are arrayed 
in alternating aqueous and lipidic layers with lattice contacts formed 
almost exclusively between soluble components of the complex, leaving 
receptor molecules suspended between G protein layers and widely 
separated from one another in the plane of the membrane. Extensive 
lattice contacts are formed among all the soluble proteins, probably 
accounting for the strong overall diffraction and remarkably clear elec- 
tron density for the G protein. Nb35 and T4L facilitated crystal forma- 
tion. Nb35 packs at the interface of the GB and Go subunits, with the 
complementarity determining region (CDR) 1 interacting primarily 
with GB and a long CDR3 loop interacting with both GB and Ga 
subunits. The framework regions of Nb35 from one complex also inter- 
act with Ga subunits from two adjacent complexes. T4L is linked to the 
B.AR only through amino-terminal fusion, but packs against the amino 
terminus of the GB subunit of one complex, the carboxy terminus of the 
Gy subunit of another complex, and the Ga subunit of yet another 
complex. Figure 2b shows the structure of the complete complex includ- 
ing T4L and Nb35, and Fig. 2c shows the B,AR-Gs complex alone. 


Structure of the active-state B.AR 

The B,AR-Gs structure provides the first high-resolution insight into 
the mechanism of signal transduction across the plasma membrane 
by a GPCR, and the structural basis for the functional properties of 
the ternary complex. Figure 3a compares the structures of the 
agonist-bound receptor in the B,AR-Gs complex and the inactive 
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carazolol-bound BAR. The largest difference between the inactive 
and active structures is a 14A outward movement of TM6 when 
measured at the Cx carbon of E268. There is a smaller outward move- 
ment and extension of the cytoplasmic end of the TM5 helix by 7 
residues. A stretch of 26 amino acids in the third intracellular loop 
(ICL3) is disordered. Another notable difference between inactive and 
active structures is the second intracellular loop (ICL2), which forms 
an extended loop in the inactive §,AR structure and an a-helix in the 
B.AR-Gs complex. This helix is also observed in the B,AR-Nb80 
structure (Fig. 3b); however, it may not be a feature that is unique 
to the active state, because it is also observed in the inactive structure 
of the highly homologous avian 8; AR (ref. 17). 

The quality of the electron density maps for the BAR is highest at 
the B;AR-GoasRas interface, and much weaker for the extracellular 
half. The extracellular half of the receptor is not stabilized by any 
packing interactions either laterally with adjacent receptors in the 
membrane or through the extracellular surface. Instead, the extracel- 
lular region is indirectly tethered to the well-packed soluble com- 
ponents by the amino-terminal fusion to T4 lysozyme (Fig. 2a). 
Given the flexible and dynamic nature of GPCRs, the absence of 
stabilizing packing interactions may lead to structural heterogeneity 
in the extracellular half of the receptor and, consequently, to the limited 
quality of the electron density maps. However, the overall structure of 
the B,AR in the T4L-B,AR-Gs complex is very similar to our recent 
active-state structure of BAR stabilized by a G protein mimetic nano- 
body (Nb80)’”. In the 8B, AR-Nb80 crystal, each receptor molecule has 
extensive packing interactions with adjacent receptors and the quality 
of the electron density maps for the agonist-bound BAR in this 
complex is remarkably good for a 3.5A structure. Therefore, the 
B.AR-Nb80 structure allows us to confidently model BI-167107 here, 
and provide a more reliable view of the conformational rearrange- 
ments of amino acids around the ligand-binding pocket and between 
the ligand-binding pocket and the Gs-coupling interface’. 
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Figure 2 | Overall structure of the B,AR-Gs 
complex. a, Lattice packing of the complex shows 
alternating layers of receptor and G protein within 
the crystal. Abundant contacts are formed among 
proteins within the aqueous layers. b, The overall 
structure of the asymmetric unit contents shows 
the BAR (green) bound to an agonist (yellow 
spheres) and engaged in extensive interactions with 
Gus (orange). Gas together with GB (cyan) and Gy 
(purple) constitute the heterotrimeric G protein 
Gs. A Gs-binding nanobody (red) binds the G 
protein between the o and B subunits. The 
nanobody (Nb35) facilitates crystallization, as does 
T4 lysozyme (magenta) fused to the amino 
terminus of the BAR. c, The biological complex 
omitting crystallization aids, showing its location 
and orientation within a cell membrane. 
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Cytoplasmic view 


The overall root mean square deviation between the BAR compo- 
nents in the B,AR-Gs and B,AR-Nb80 structures is approximately 
0.6 A, and they differ most at the cytoplasmic ends of transmembrane 
helices 5 and 6 where they interact with the different proteins (Fig. 3b-d). 
The largest divergence is a 3 A outward movement at the end of helix 6 
in the B,AR-Gs complex. However, the differences between these two 
structures are very small at the level of the most highly conserved 
amino acids (E/DRY and NPxxY), which are located at the cytoplasmic 
ends of the transmembrane segments (Fig. 3c, d). These conserved 
sequences have been proposed to be important for activation or for 
maintaining the receptor in the inactive state’*. Of these residues, only 
Arg 131 differs significantly between these two structures. In B;AR- 
Nb80 Arg 131 interacts with Nb80, whereas in the B,AR-Gs structure 
Arg 131 packs against Tyr 391 of Gas (Supplementary Fig. 3). The high 
structural similarity is in agreement with the functional similarity of 
these two proteins. The B.AR-Nb80 complex shows the same high 
affinity for the agonist isoproterenol as does the B,AR-Gs complex”, 
consistent with high structural homology around the ligand binding 
pocket. 

The active state of the BAR is stabilized by extensive interactions 
with GasRas (Fig. 4). There are no direct interactions with GB or Gy 
subunits. The total buried surface of the B,AR-GasRas interface is 
2,576 A? (1,300 A? for GasRas and 1,276 A? for the BAR). This inter- 
face is formed by ICL2, TM5 and TM6 of the B,AR, and by «5-helix, 
the xN-f1 junction, the top of the B3-strand, and the «4-helix of 
GasRas (see Supplementary Table 1 for specific interactions). Some 
of the 8, AR sequences involved in this interaction have been shown to 
have a role in G protein coupling; however, there is no clear consensus 
sequence for Gs-coupling specificity when these segments are aligned 
with other GPCRs. Perhaps this is not surprising considering that the 
B2AR also couples to Gi and that many GPCRs couple to more than 
one G protein isoform. Of the 21 amino acids of Gs that are within 4 A 
of the BAR, only five are identical between Gs and Gi, and all of these 
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are in the carboxy-terminal helix. The structural basis for G protein 
coupling specificity must therefore involve more subtle features of the 
secondary and tertiary structure. Nevertheless, a noteworthy inter- 
action involves Phe 139, which is located at the beginning of the ICL2 
helix and sits in a hydrophobic pocket formed by Gas His 41 at the 
beginning of the B1-strand, Val 217 at the start of the B3-strand and 
Phe 376, Cys 379, Arg 380 and Ile 383 in the «5-helix (Fig. 4c). The 
B,AR mutant F139A has severely impaired coupling to Gs'’. The 
residue corresponding to Phe 139 is a Phe or Leu on almost all Gs 
coupled receptors, but is more variable in GPCRs known to couple to 
other G proteins. Of interest, the ICL2 helix is stabilized by an inter- 
action between Asp 130 of the conserved DRY sequence and Tyr 141 


Figure 3 | Comparison of active and inactive 
B2AR structures. a, Side and cytoplasmic views of 
the B,AR-Gs structure (green) compared to the 
inactive carazolol-bound {AR structure’ (blue). 
Significant structural changes are seen for the 
intracellular domains of TM5 and TM6. TMS is 
extended by two helical turns whereas TM6 is 
moved outward by 14 A as measured at the 
a-carbons of Glu 268 (yellow arrow) in the two 
structures. b, B,AR-Gs compared with the 
nanobody-stabilized active state B, AR-Nb80 
structure'” (orange). c, The positions of residues in 
the E/DRY and NPxxY motifs and other key 
residues of the B,AR-Gs and B,.AR-Nb80 
structures. All residues occupy very similar 
positions except Arg 131 which in the B,AR-Nb80 
structure interacts with the nanobody. d, View 
from the cytoplasmic side of residues shown in c. 
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in the middle of the ICL2 helix (Fig. 4c). Tyr 141 has been shown to be 
a substrate for the insulin receptor tyrosine kinase*’; however, the 
functional significance of this phosphorylation is currently unknown. 

The lack of direct interactions between the BAR and Gy is some- 
what unexpected given that a heterotrimer is required for efficient 
coupling toa GPCR. Whereas Gf does not interact directly with the 
BAR, it has an indirect but important role in coupling by stabilizing 
the amino-terminal helix of Gas (Fig. 2c). Several models involving 
GPCR dimers propose that one of the protomers interacts predomi- 
nantly with Ga while the other interacts with GBy*’*’. Consistent 
with these models, biochemical and biophysical evidence suggests 
that Gai2 forms a stable complex with a LTB4 receptor dimer”. 


Figure 4 | Receptor-G protein interactions. a, b, The o5-helix of Gas docks 
into a cavity formed on the intracellular side of the receptor by the opening of 
transmembrane helices 5 and 6. a, Within the transmembrane core, the 
interactions are primarily non-polar. An exception involves packing of Tyr 391 
of the «5-helix against Arg 131 of the conserved DRY sequence in TM3 (see also 
Supplementary Fig. 3). Arg 131 also packs against Tyr 326 of the conserved 
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NPxxY sequence in TM7. b, As «5-helix exits the receptor it forms a network of 
polar interactions with TM5 and TM3. c, Receptor residues Thr 68 and Asp 130 
interact with the ICL2 helix of the B2AR via Tyr 141, positioning the helix so 
that Phe 139 of the receptor docks into a hydrophobic pocket on the G protein 
surface, thereby structurally linking receptor-G protein interactions with the 
highly conserved DRY motif of the B,AR. 
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Whereas the BAR efficiently activates Gs as a monomer, extensive 
biochemical and biophysical evidence supports the existence of B.AR 
dimers or oligomers in living cells**. Therefore, we cannot exclude the 
possibility that in cell membranes one protomer ofa BAR dimer may 
interact with the GBy subunit. 


Structure of activated Gs 


The most surprising observation in the B,AR-Gs complex is the large 
displacement of the GasAH relative to GasRas (an approximately 127° 
rotation about the junction between the domains) (Fig. 5a). GusAH 
moves as a rigid body as shown by the alignment of B,AR-Gs with 
GasAH from the crystal structure of Gus-GTPyS** (Supplementary 
Fig. 4). In the structure of Gas-GTPYS, the nucleotide-binding pocket 
is formed by the interface between GasRas and GasAH. Guanine 
nucleotide binding stabilizes the interaction between these two 
domains. The loss of this stabilizing effect of guanine nucleotide bind- 
ing is consistent with the high flexibility observed for GasAH in single 
particle electron microscopy analysis of the detergent-solubilized com- 
plex”. It is also in agreement with the increase in deuterium exchange 
at the interface between these two domains upon formation of the 
complex*’. Recently double electron-electron resonance (DEER) spec- 
troscopy was used to document large (up to 20 A) changes in distance 
between nitroxide probes positioned on the Ras and «-helical domains 
of Gi upon formation of a complex with light-activated rhodopsin”’. 
Finally, it has been shown that GasRas and GasAH can form a func- 
tional GTPase when expressed together as separate proteins”. 
Therefore, it is perhaps not surprising that GasAH is displaced rela- 
tive to GausRas; however, its location in this crystal structure most 
probably reflects only one of an ensemble of conformations that it 
can adopt under physiological conditions, but has been stabilized by 
crystal packing interactions (Supplementary Fig. 5). 

A potential concern is that Nb35, which was used to facilitate crys- 
tallogenesis, may be responsible for the displacement of the GasAH. 
However, Nb35, which binds at the interface between GB and GasRas 
(Fig. 2b and Supplementary Fig. 6), would not be expected to interact 
with the GasAH or interfere with its interactions with GasRas. None of 
the Nb35 contacts on the Ras domain are involved in interactions with 
GasAH on the basis of the crystal structure of Gas-GTPyS (1AZT). 
Moreover, if we superimpose the structures of the Ras domains of Gas— 
GTPyS (1AZT) and B,AR-Gs, there is no overlap between Nb35 and 
the o-helical domain of Gas—GTPYS (Supplementary Fig. 6). Similarly, 
if we align the GB subunits of the Gi-GDP heterotrimer (1GP2) with 


Figure 5 | Conformational changes in Gas. a, A comparison of Gas in the 
B,AR-Gs complex (orange) with the GIPyS-bound Gas (grey)”® (PDB ID: 
1AZT). GTPyS is shown as spheres. The helical domain of Gas (GasAH) shows 
a marked displacement relative to its position in the GIPyS-bound state. b, The 
a5-helix of Gas is rotated and displaced towards the BAR, perturbing the 
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that of B,AR-Gs, there is no overlap between Nb35 and the o-helical 
domain of Gi (Supplementary Fig. 6). This analysis is in agreement with 
single particle electron microscopy studies which provide further evid- 
ence that Nb35 does not disrupt interactions between GasAH and 
GasRas™. 

The conformational links between the B,AR and the nucleotide- 
binding pocket primarily involve the amino- and carboxy-terminal 
helices of Gas (Fig. 4). Figure 5b focuses on the region of GasRas that 
undergoes the largest conformational change when comparing the 
structure of GasRas from the B,AR-Gs complex with that from the 
Gas-GTPyS complex”’. The largest difference is observed for the «5- 
helix, which is displaced 6 A towards the receptor and rotated as the 
carboxy-terminal end projects into transmembrane core of the BAR. 
Previous studies using a variety of approaches have demonstrated the 
important role of the «5-helix in GPCR-G protein interactions””®. 
Associated with movement of the o5-helix, the B6-05 loop, which 
interacts with the guanine ring in the Gas-GTPYS structure, is dis- 
placed outward, away from the nucleotide-binding pocket (Fig. 5b-d). 
The movement of the «5-helix is also associated with changes in 
interactions between this helix and the B6-strand, the «N-B1 loop, 
and the o1-helix. The §1-strand forms another link between the BAR 
and the nucleotide-binding pocket. The carboxy-terminal end of this 
strand changes conformation around Gly 47, and there are further 
changes in the B1-«1 loop (P-loop) that coordinates the B-phosphate 
in the GDP and GTP-bound forms (Fig. 5b-d). The observations in the 
crystal structure are in agreement with deuterium exchange experi- 
ments where there is enhanced deuterium exchange in the B1-strand 
and the amino-terminal end of the «5-helix upon formation of the 
nucleotide-free B.AR-Gs complex*’. The deuterium exchange-mass 
spectrometry (DXMS) studies provide additional insights into the 
dynamic nature of these conformational changes in Gs upon complex 
formation*’. 

The structure of a GDP-bound Gs heterotrimer has not been deter- 
mined, so it is not possible to directly compare the Gus—GBy interface 
before and after formation of the B,AR-Gs complex. On the basis of 
the structure of the GDP-bound Gi heterotrimer*’, we do not observe 
large changes in interactions between GasRas and GBy upon forma- 
tion of the complex with BAR. This is also consistent with deuterium 
exchange studies*’. As discussed above, Nb35 binds at the interface 
between GasRas and Gf (Fig. 2b); therefore, we cannot exclude the 
possibility that Nb35 may influence the relative orientation of the 
GasRas-Gfy interface in the crystal structure. 
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(empty; 
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B6-«5 loop, which otherwise forms part of the GTPyS-binding pocket. c, The 
B1-a1 loop (P-loop) and B6-«5 loop of Gas interact with the phosphates and 
purine ring, respectively, of GTPyS in the Gus-GTPYS structure. d, The §1-c1 
and 26-05 loops are rearranged in the nucleotide-free 8,AR-Gs structure. 
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Assembly of the B.AR-Gs complex 


Clues to the initial stages of complex formation may come from the 
recent active state structures of rhodopsin****. Figure 6a, b compares 
the active-state structure of BAR in the B,AR-Gs complex with the 
recent structure of metarhodopsin II bound to a peptide representing 
the carboxy terminus of transducin*’. The conformational changes in 
TM5 and TM6 are smaller in metarhodopsin II, and the position of 
the carboxy-terminal helix of transducin is tilted by approximately 
30° relative to the position of the homologous region of Gs. These may 
represent fundamental differences in the receptor-G protein interac- 
tions between these two proteins, but given the conservation of the 
G-protein binding pocket, the changes more probably reflect the more 
extensive contacts formed with the intact G protein. The position of 
the transducin peptide in metarhodopsin II may represent the initial 
interaction between a GDP-bound G protein and a GPCR. We have 
attempted to reproduce a similar complex between the B,AR and a 
synthetic peptide representing the carboxy-terminal 20 amino acids 
of Gs, but did not observe any effect of this peptide on receptor 
function, possibly due to the solubility and behaviour of the peptide 
in solution. However, when the carboxy-terminal 20 amino acids of 
Gs are fused to the carboxy terminus of the BAR (Fig. 6c), we observe 
a 27-fold increase in agonist affinity (Fig. 6d). This effect is only 
3.5-fold smaller than the effect we observe on agonist binding affinity 
in the B,AR-Gs complex, and demonstrates that there is a functional 
interaction between the peptide and receptor that may represent an 
initial stage in B,AR-Gs complex formation. Figure 6e and f presents 
a possible sequence of interactions of B,AR and Gs when forming the 
nucleotide-free complex. The first interaction of the B, AR with the Gs 
heterotrimer would require a movement of the carboxy terminus of 
the «5-helix away from the 6-strand to permit interactions with the 
B2AR similar to those observed in metarhodopsin II (Fig. 6e). The 
availability of the carboxy terminus of the «5-helix for interactions 
with the BAR is supported by deuterium exchange studies showing 
that this segment is more dynamic in the Gs-GDP heterotrimer than 
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would be expected from the crystal structure of Gus”’. The subsequent 
formation of more extensive interactions between the BAR ICL2 and 
the amino terminus of Gas requires a rotation of GasRas relative to 
the receptor and would be associated with further conformational 
changes in both B,AR and GasRas (Fig. 6f). We cannot say when 
GDP is released during the formation of the complex; however, we 
speculate that uncoupling of the GasAH from GasRas is a consequence 
of nucleotide release or at least a coincident event. This binding model 
is in agreement with deuterium exchange experiments”. 

The B,AR-Gs complex crystal structure provides the first high- 
resolution view of transmembrane signalling for a GPCR. We now 
have a framework to design experiments to investigate the mechanism 
of complex formation, GTP binding and complex dissociation. Of 
particular interest will be studies designed to determine the functional 
significance of the large movement of GasAH relative to GasRas that is 
observed in the B,AR-Gs complex. A better understanding of the 
structural basis for G protein activation may provide new approaches 
for drug discovery. The high degree of structural homology within the 
ligand-binding pocket has posed challenges for developing highly 
selective drugs for specific GPCR targets. In contrast, there is relatively 
low homology at the interface between the B.AR and Gas, so identify- 
ing sequence and structural features that define specificity for particu- 
lar G proteins may enable the development of selective inhibitors of 
specific GPCR-G protein interactions. 


METHODS SUMMARY 


The B,AR-Gs complex was crystallized from AR and Gs protein expressed in 
insect cells. Crystallogenesis was aided by fusing T4 lysozyme to the amino 
terminus of the B.AR and the addition of a nanobody (Nb35) that binds at the 
interface between the Ga and Gf subunits. Crystals were grown in a LCP using 
7.7 MAG, a lipid that accommodates membrane proteins with larger hydrophilic 
surfaces’’. Diffraction data were measured at beamline 23ID-B of the Advanced 
Photon Source and the structure was solved by molecular replacement. For more 
experimental details see Methods. 


Figure 6 | Possible sequence of B,AR-Gs 
complex formation. a, b, Comparison of the 
B,AR-Gs structure (green and orange) with 
metarhodopsin II? (PDB ID: 3PQR) (purple) bound 
with the carboxy-terminal peptide of transducin (G,) 
(blue). TM7 has been omitted in panel a to better 
visualize the G proteins. c, Cartoon of the 8B; AR—Gas 
peptide fusion construct used in the binding 
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a8. 8. =A experiments (d). d, Competition binding 
experiments between [*H]dihydroalprenolol 
((PH]DHA) and full agonist isoproterenol. Top 
panel shows binding data (reproduced from ref. 12) 
on BAR reconstituted in HDL particles with and 
without Gs heterotrimer. The fraction of B,AR in the 
K; nigh State for the By AR with Gs is 0.55. Bottom 
panel shows binding to B,AR and a B,AR-Gas 
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(B,AR-Gs structure) 


eee ey peptide fusion expressed in S/9 cell membranes. The 
fraction of B,AR in the K; yign state for the B,AR- 
Gas peptide fusion is 0.68. e, The initial interaction 
of agonist-bound BAR and GasRas may involve an 
orientation of the carboxy terminus of GasRas 
similar to that of the carboxy-terminal peptide of 
transducin in the structure of metarhodopsin II. 
f, The final position of GasRas on the B,AR as 
observed in the B,AR-Gs complex. 


©2011 Macmillan Publishers Limited. All rights reserved 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Expression and purification of B,AR, Gs heterotrimer and nanobody-35. An 
amino-terminally fused T4 lysozyme-B,AR construct with B,AR truncated in 
position 365 (T4L-B.AR, described in detail below) was expressed in Sf9 insect 
cell cultures infected with recombinant baculovirus (BestBac, Expression 
Systems), and solubilized in n-dodecyl-B-p-maltoside (DDM) according to 
methods described previously*® (see Supplementary Fig. 7 for purification over- 
view). A BAR construct truncated after residue 365 (B,AR-365) was used for the 
majority of the analytical experiments. M1 Flag affinity chromatography (Sigma) 
served as the initial purification step followed by alprenolol-Sepharose chromato- 
graphy for selection of functional receptor. A subsequent M1 Flag affinity 
chromatography step was used to exchange receptor-bound alprenolol for 
high-affinity agonist BI-167107. The agonist-bound receptor was eluted, dialysed 
against buffer (20 mM HEPES, pH 7.5, 100 mM NaCl, 0.1% DDM and 10 uM BI- 
167107), treated with lambda phosphatase (New England Biolabs), and concen- 
trated to approximately 50mg ml ’ with a 50kDa molecular weight cut off 
(MWCO) Millipore concentrator. Prior to spin concentration, the B,AR-365 
construct, but not T4L-B,AR, was treated with PNGaseF (New England 
Biolabs) to remove amino-terminal amino-linked glycosylation. The purified 
receptor was routinely analysed by SDS-PAGE/Coomassie brilliant blue staining 
(see Supplementary Fig. 8a). 

Bovine Gas short, Hisg-rat GB, and bovine Gy2 were expressed in HighFive 
insect cells (Invitrogen) grown in Insect Xpress serum-free media (Lonza). 
Cultures were grown to a density of 1.5 million cells per ml and then infected with 
three separate Autographa californica nuclear polyhedrosis viruses each contain- 
ing the gene for one of the G protein subunits at a 1:1 multiplicity of infection (the 
viruses were a gift from A. Gilman). After 40-48 h of incubation the infected cells 
were harvested by centrifugation and resuspended in 75 ml lysis buffer (50 mM 
HEPES, pH 8.0, 65 mM NaCl, 1.1 mM MgCl,, 1 mM EDTA, 1X PTT (35 pg ml! 
phenylmethanesulphonyl fluoride, 32 1g ml~’ tosyl phenylalanyl chloromethyl 
ketone, 32 11g ml tosyl lysyl chloromethyl ketone), 1X LS (3.2 ug ml! leupeptin 
and 3.2 1gml~' soybean trypsin inhibitor), 5mM B-mercaptoethanol (f-ME), 
and 10 4M GDP) per litre of culture volume. The suspension was pressurized with 
600 p.s.i. N2 for 40 min in a nitrogen cavitation bomb (Parr Instrument Company). 
After depressurization, the lysate was centrifuged to remove nuclei and unlysed 
cells, and then ultracentrifuged at 180,000g for 40 min. The pelleted membranes 
were resuspended in 30 ml wash buffer (50 mM HEPES, pH 8.0, 50 mM NaCl, 
100 pM MgCl, 1x PTT, 1X LS, 5mM B-ME, 10M GDP) per litre culture 
volume using a Dounce homogenizer and centrifuged again at 180,000g for 
40 min. The washed pellet was resuspended in a minimal volume of wash buffer 
and flash-frozen with liquid nitrogen. 

The frozen membranes were thawed and diluted to a total protein concentra- 
tion of 5 mg ml’ with fresh wash buffer. Sodium cholate detergent was added to 
the suspension at a final concentration of 1.0%, MgCl, was added to a final 
concentration of 5 mM, and 0.05 mg of purified protein phosphatase 5 (prepared 
in house) was added per litre of culture volume. The sample was stirred on ice for 
40 min, and then centrifuged at 180,000g for 40 min to remove insoluble debris. 
The supernatant was diluted fivefold with Ni-NTA load buffer (20 mM HEPES, 
pH8.0, 363mM NaCl, 1.25mM MgCl, 6.25mM imidazole, 0.2% Anzergent 
3-12, 1X PTT, 1X LS, 5mM £-ME, 10 uM GDP), taking care to add the buffer 
slowly to avoid dropping the cholate concentration below its critical micelle 
concentration too quickly. Ni-NTA resin (3 ml; Qiagen) pre-equilibrated in Ni- 
NTA wash buffer 1 (20 mM HEPES, pH 8.0, 300 mM NaCl, 2mM MgCl, 5mM 
imidazole, 0.2% cholate, 0.15% Anzergent 3-12, 1X PTT, 1X LS, 5mM B-ME, 
10 tM GDP) per litre culture volume was added and the sample was stirred on ice 
for 20min. The resin was collected into a gravity column and washed with 
4X column volumes of Ni-NTA wash buffer 1, Ni-NTA wash buffer 2 (20 mM 
HEPES, pH 8.0, 50 mM NaCl, 1 mM MgCl, 10 mM imidazole, 0.15% Anzergent 
3-12, 0.1% DDM, 1X PTT, 1X LS,5 mM B-ME, 10 1M GDP), and Ni-NTA wash 
buffer 3 (20 mM HEPES, pH 8.0, 50mM NaCl, 1mM MgCl, 5mM imidazole, 
0.1% DDM, 1X PTT, 1X LS, 5mM B-ME, 10 uM GDP). The protein was eluted 
with Ni-NTA elution buffer (20 mM HEPES, pH 8.0, 40 mM NaCl, 1mM MgCh, 
200 mM imidazole, 0.1% DDM, 1X PTT, 1X LS, 5mM £-ME, 10M GDP). 
Protein-containing fractions were pooled and MnCl, was added to a final con- 
centration of 100 11M. Purified lambda protein phosphatase (50 ig; prepared in 
house) was added per litre of culture volume and the eluate was incubated on ice 
with stirring for 30 min. The eluate was passed through a 0.22-1m filter and 
loaded directly onto a MonoQ HR 16/10 column (GE Healthcare) equilibrated 
in MonoQ buffer A (20 mM HEPES, pH 8.0, 50 mM NaCl, 100 1M MgCh, 0.1% 
DDM, 5mM B-ME, 1X PTT). The column was washed with 150 ml buffer A at 
5 ml min ' and bound proteins were eluted over 350 ml with a linear gradient up 
to 28% MonoQ buffer B (same as buffer A except with 1 M NaCl). Fractions were 
collected in tubes spotted with enough GDP to make a final concentration of 


10 1M. The Gs-containing fractions were concentrated to 2 ml using a stirred 
ultrafiltration cell (Amicon) with a 10-kDa nominal molecular weight limit 
(NMWL) regenerated cellulose membrane (Millipore). The concentrated sample 
was run on a Superdex 200 prep grade XK 16/70 column (GE Healthcare) equili- 
brated in $200 buffer (20 mM HEPES, pH 8.0, 100mM NaCl, 1.1mM MgCh, 
1mM EDTA, 0.012% DDM, 100 uM TCEP, 2 tM GDP). The fractions contain- 
ing pure Gs were pooled, glycerol was added to 10% final concentration, and then 
the protein was concentrated to at least 10mgml * using a 30kDa MWCO 
centrifugal ultrafiltration device (Millipore). The concentrated sample was then 
aliquoted, flash frozen, and stored at —80 °C. A typical yield of final, purified Gs 
heterotrimer from 8] of cell culture volume was 6 mg. 

Nanobody-35 (Nb35) was expressed in the periplasm of E. coli strain WK6, 

extracted, and purified by nickel affinity chromatography according to previously 
described methods’* followed by ion-exchange chromatography (Supplementary 
Fig. 9a) using a Mono S 10/100 GL column (GE Healthcare). Selected Nb35 
fractions were dialysed against buffer (10 mM HEPES, pH7.5, 100mM NaCl) 
and concentrated to approximately 65 mg ml‘ with a 10kDa MWCO Millipore 
concentrator. 
Complex formation, stabilization and purification. Formation of a stable com- 
plex (see Supplementary Fig. 10) was accomplished by mixing Gs heterotrimer at 
approximately 10011M concentration with BI-167107-bound T4L-B.AR (or 
B,AR-365) in molar excess (approximately 130M) in 2ml buffer (10 mM 
HEPES, pH7.5, 100 mM NaCl, 0.1% DDM, 1mM EDTA, 3mM MgCh, 10 1M 
BI-167107) and incubating for 3h at room temperature. BI-167107, which was 
identified from screening and characterizing approximately 50 different B.AR 
agonists (data not shown), has a dissociation half-time of approximately 30h, 
providing a higher degree of stabilization to the active G protein-bound receptor 
than other full agonists such as isoproterenol’*. To maintain the high-affinity 
nucleotide-free state of the complex, apyrase (25 mU ml ', NEB) was added after 
90 min to hydrolyse residual GDP released from Gas upon binding to the recep- 
tor. GMP resulting from hydrolysis of GDP by apyrase has very poor affinity for 
the G protein in the complex. Rebinding of GDP can cause dissociation of the 
B.AR-Gs complex (Supplementary Fig. 1a). 

The §,AR-Gs complex in DDM shows significant dissociation after 48h at 
4°C (Supplementary Fig. 11a). We screened and characterized over 50 amphi- 
philes (data not shown) and identified MNG-3 (refs 12, 13; NG-310, Affymetrix- 
Anatrace) and its closely related analogues as detergents that substantially stabilize 
the complex (Supplementary Fig. 11a). The complex was exchanged into MNG-3 by 
adding the 8.AR-Gs mixture (2 ml) to 8 ml buffer (20 mM HEPES, pH 7.5, 100 mM 
NaCl, 10 1M BI-167107) containing 1% MNG-3 for 1h at room temperature. 

At this stage the mixture contains the B,AR-Gs complex, non-functional Gs, 
and an excess of B,AR. To separate functional B.AR-Gs complex from non- 
functional Gs, and to complete the detergent exchange, the B,AR-Gs complex 
was immobilized on M1 Flag resin and washed in buffer (20 mM HEPES, pH 7.5, 
100 mM NaCl, 10 1M BI-167107, and 3 mM CaCl,) containing 0.2% MNG-3. To 
prevent cysteine bridge-mediated aggregation of B,AR-Gs complexes, 100 1M 
TCEP was added to the eluted protein before concentrating it with a 50 kDa 
MWCO Millipore concentrator. Of note, it was discovered later that crystal 
growth improved at even higher TCEP concentrations (above 1 mM) compared 
to 100 pM TCEP, and that the integrity of the B.AR-Gs complex in MNG-3 was 
stable to 10 mM TCEP as measured by gel filtration analysis (Supplementary Fig. 12). 
In contrast, DDM-solubilized BAR loses its ability to bind the high-affinity 
antagonist [(’H]dihydroalprenolol ([7H]DHA) in 10 mM TCEP (data not shown), 
probably due to disruption of extracellular disulphide bonds. Iodoacetamide could 
not be used to block reactive cysteines on Gs « and £ subunits as it caused 
dissociation of the },AR-Gs complex (Supplementary Fig. 12b). The final size 
exclusion chromatography procedure to separate excess free receptor from the 
B,AR-Gs complex (Supplementary Fig. 8b) was performed on a Superdex 200 10/ 
300 GL column (GE Healthcare) equilibrated with buffer containing 0.02% MNG- 
3, 10mM HEPES, pH7.5, 100mM NaCl, 10 1M BI-167107 and 100 uM TCEP. 
Peak fractions were pooled (Supplementary Fig. 8b) and concentrated to approxi- 
mately 90 mg ml! with a 100kDa MWCO Viva-spin concentrator and analysed 
by SDS-PAGE/Coomassie brilliant blue staining (Supplementary Fig. 8a) and gel 
filtration (Supplementary Fig. 8c). To confirm a pure, homogeneous and depho- 
sphorylated preparation, the B.AR-Gs complex was routinely analysed by ion 
exchange chromatography (Supplementary Fig. 8d). 

Protein engineering. To increase the probability of obtaining crystals of the 
B.AR-Gs complex we set out to increase the polar surface area on the extracel- 
lular side of the receptor using two strategies. The first approach, to generate 
extracellular binding antibodies, was not successful. The second approach was to 
replace the flexible and presumably unstructured amino terminus with the globu- 
lar protein T4 lysozyme (T4L) used previously to crystallize and solve the 
carazolol-bound receptor*® The construct used here (T4L-B,AR) contained the 
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cleavable signal sequence followed by the M1 Flag epitope (DYKDDDDA), the 
Tobacco etch virus (TEV) protease recognition sequence (ENLYFQG), bacterio- 
phage T4 lysozyme from N2 through Y161 including C54T and C97A mutations, 
and a two residue alanine linker fused to the human B,AR sequence D29 through 
G365. The PNGaseF-inaccessible glycosylation site of the B.AR at N187 was 
mutated to Glu. M96 and M98 in the first extracellular loop were each replaced 
by Thr to increase the otherwise low expression level of T4L-BAR. The threonine 
mutations did not affect ligand binding affinity for (?H]DHA, but caused a small, 
approximately twofold decrease in affinity for isoproterenol (data not shown). 
The B,AR-Gs peptide fusion construct used for [7H]DHA competition bind- 
ing with isoproterenol was constructed from the receptor truncated at position 
365 and fused to the last 21 amino acids of the Gas subunit (amino acids 374-394, 
except for C379A). A Gly-Ser is inserted between the receptor and the peptide. 
Also an extended TEV protease site (SENLYFQGS) was introduced in the BAR 
between G360 and G361. 
Stabilization of Gs with nanobodies. From negative stain electron micro- 
scopy imaging™, we observed that the o-helical domain of Gas was flexible and 
therefore possibly responsible for poor crystal quality. Targeted stabilization of 
this domain was addressed by immunizing two llamas (Lama glama) with the 
bis(sulphosuccinimidyl)glutarate (BS2G, Pierce) cross-linked B,AR-Gs-BI- 
167107 ternary complex. Peripheral blood lymphocytes were isolated from the 
immunized animals to extract total RNA, prepare cDNA and construct a 
Nanobody phage display library according to published methods’*. Nb35 and 
Nb37 were enriched by two rounds of biopanning on the B,AR-Gs-BI-167107 
ternary complex embedded in biotinylated high-density lipoprotein particles”. 
Nb35 and Nb37 were selected for further characterization because they bind the 
B.AR-Gs-BI-167107 ternary complex but not the free receptor in an ELISA assay. 
Nanobody binding to the B,AR-Gs complex was confirmed by size exclusion 
chromatography (Supplementary Fig. 1d), and it was noted that both nanobodies 
protected the complex from dissociation by GTPYS, suggestive of a stabilizing 
Gs-Nb interaction (Supplementary Fig. 1d). 
Crystallization. BI-167107 bound T4L-B,AR-Gs complex and Nb35 were mixed 
in 1:1.2 molar ratio. The small molar excess of Nb35 was verified by analytical gel 
filtration (see Supplementary Fig. 9b). The mixture incubated for 1h at room 
temperature before mixing with 7.7 MAG (provided by M. Caffrey) containing 
10% cholesterol (C8667, Sigma) in 1:1 protein solution to lipid ratio (w/w) using 
the twin-syringe mixing method reported previously**. The concentration of T4L- 
B,AR-Gs-Nb35 complex in 7.7 MAG was approximately 25 mg ml |. We believe 
the detergent MNG-3 stabilizes the T4L-B,AR-Gs complex during its incorpora- 
tion into the lipid cubic phase. This may be due to the high affinity of MNG-3 for 
the receptor. The B.AR in MNG-3 maintains its structural integrity even when 
diluted below the CMC of the detergent, in contrast to B.AR in DDM, which 
rapidly loses binding activity (Supplementary Fig. 11b). Moreover, MNG-3 
improved crystal size and quality, as previously reported'*’**’. The protein:lipid 
mixture was delivered through an LCP dispensing robot (Gryphon, Art Robbins 
Instruments) in 40 nl drops to either 24-well or 96-well glass sandwich plates and 
overlaid en-bloc with 0.8 ll precipitant solution. Multiple crystallization leads were 
initially identified using in-house screens partly based on reagents from the 
StockOptions Salt kit (Hampton Research). Crystals for data collection were grown 
in 18 to 22% PEG 400, 100mM MES, pH6.5 (Supplementary Fig. 1c), 350 to 
450mM potassium nitrate, 10mM foscarnet (Supplementary Fig. 1b), 1mM 
TCEP (Supplementary Fig. 12c), and 10 uM BI-167107. Crystals reached full size 
within 3-4 days at 20 °C and were picked from a sponge-like mesophase and flash- 
frozen in liquid nitrogen without additional cryoprotectant. 
Microcrystallography data collection and processing. Diffraction data were 
measured at the Advanced Photon Source beamline 23 ID-B. Hundreds of crys- 
tals were screened, and a final data set was compiled using diffraction wedges of 
typically 10 degrees from 20 strongly diffracting crystals (Supplementary Table 
2). All data reduction was performed using HKL2000 (ref. 40). Although in many 
cases diffraction to beyond 3 A was seen in initial frames, radiation damage and 
anisotropic diffraction resulted in low completeness in higher resolution shells. 
Analysis of the final data set by the UCLA diffraction anisotropy server*! indi- 
cated that diffraction along the a* axis was superior to that in other directions. On 
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the basis of an F/o(F) cutoff of 3 along each reciprocal space axis, reflections were 
subjected to an anisotropic truncation with resolution limits of 2.9, 3.2 and 3.2 A 
along a*, b* and c* before use in refinement. We report this structure to an overall 
resolution of 3.2 A. Despite the low completeness in the highest resolution shells 
(Supplementary Table 3) inclusion of these reflections gave substantial improve- 
ments in map quality and lower R¢,-e during refinement. 

Structure solution and refinement. The structure was solved by molecular 
replacement using Phaser***’. The order of the molecular replacement search 
was found to be critical in solving the structure. In order, the search models used 
were: the B and y subunits from a Gi heterotrimer (PDB ID: 1GP2), the Gs « Ras- 
like domain (PDB ID: 1AZT), the active-state BAR (PDB ID: 3P0G), a B2AR- 
binding nanobody (PDB ID: 3P0G), T4 lysozyme (PDB ID: 2RH1), and the Gs 
a-helical domain (PDB ID: 1AZT). Following the determination of the initial 
structure by molecular replacement, rigid body refinement and simulated anneal- 
ing were performed in Phenix** and BUSTER", followed by restrained refine- 
ment and manual rebuilding in Coot**. After iterative refinement and manual 
adjustments, the structure was refined in CNS using the DEN method”. Although 
the resolution of this structure exceeds that for which DEN is typically most 
useful, the presence of several poorly resolved regions indicated that the incorp- 
oration of additional information to guide refinement could provide better 
results. The DEN reference models used were those used for molecular replace- 
ment, with the exception of NB35, which was well ordered and for which no 
higher resolution structure is available. Side chains were omitted from 53 residues 
for which there was no electron density past CB below a low contour level of 0.70 
in a 2Fo — Fc map. Figures were prepared using PYMOL (The PyMOL Molecular 
Graphics System, Version 1.3, Schrédinger, LLC.). MolProbity was used to deter- 
mine Ramachandran statistics®*. 

Competition binding. Membranes expressing the BAR or the B, AR-Gs peptide 
fusion were prepared from baculovirus-infected Sf9 cells and [7H] DHA-binding 
performed as previously described’. For competition binding, membranes were 
incubated with [*H]DHA (1.1 nM final) and increasing concentrations of (—)- 
isoproterenol for 1h before harvesting onto GF/B filters. Competition data were 
fitted to a two-site binding model and isoproterenol high and low K; values and 
fractions calculated using GraphPad prism. 
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Crystal structure of nucleotide-free 


dynamin 


Katja Faelber', York Posor?*, Song Gao!?*, Martin Held**, Yvette Roske!*, Dennis Schulze!, Volker Haucke’, Frank Noé? 


& Oliver Daumke!* 


Dynamin is a mechanochemical GTPase that oligomerizes around the neck of clathrin-coated pits and catalyses vesicle 
scission in a GTP-hydrolysis-dependent manner. The molecular details of oligomerization and the mechanism of the 
mechanochemical coupling are currently unknown. Here we present the crystal structure of human dynamin 1 in the 
nucleotide-free state with a four-domain architecture comprising the GTPase domain, the bundle signalling element, 
the stalk and the pleckstrin homology domain. Dynamin 1 oligomerized in the crystals via the stalks, which assemble in a 
criss-cross fashion. The stalks further interact via conserved surfaces with the pleckstrin homology domain and the 
bundle signalling element of the neighbouring dynamin molecule. This intricate domain interaction rationalizes a 
number of disease-related mutations in dynamin 2 and suggests a structural model for the mechanochemical 
coupling that reconciles previous models of dynamin function. 


Dynamin, the founding member of the dynamin superfamily, is a 
100-kDa mechanochemical enzyme (Fig. 1a) involved in the scission 
of clathrin-coated vesicles from the plasma membrane’. The brain- 
specific isoform dynamin 1 mediates uptake of synaptic vesicles in 
presynaptic terminals**, whereas a function of dynamin 3 at the post- 
synaptic density has been described®. Dynamin 2 is ubiquitously 
expressed®, and mutations in its middle domain (MD), pleckstrin 
homology (PH) domain and GTPase effector domain (GED) are linked 
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to human diseases, for example, rare forms of centronuclear myopathy 
and Charcot-Marie-Tooth peripheral neuropathy’. Upon recruitment 
via the carboxy-terminal proline-rich domain (PRD), dynamin oligo- 
merizes into helical structures around the neck of budding vesicles and 
catalyses vesicle scission in a GTP-hydrolysis-dependent manner*”. 
Different mechanisms for the scission reaction have been proposed, 
including GTP-hydrolysis-dependent constriction’®, extension" and 
twisting? of the vesicle neck. Other models suggest that the GTP-bound 


Figure 1 | Structure of nucleotide- 
free human dynamin 1. 

a, Structure-based domain 
architecture of human dynamin 1. 
The classical domain assignment is 
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indicated below. b, Ribbon-type 
representation of human dynamin 1. 
Regions not resolved in the crystal 
structure are indicated by dotted 
lines. Domains, distinct secondary 
structure elements and N and C 
termini are labelled. Lipid-binding 
residues are indicated as o. 
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dynamin oligomer induces hemifusion of the inner membrane leaflet 
followed by complete membrane scission after GTP-hydrolysis- 
dependent release’*'’. To resolve the detailed molecular mechanism, 
high-resolution structural data for full-length dynamin are required 
which, to date, are available only for the isolated PH domain'*”* and 
the GTPase (G) domain’®*'”. Low-resolution electron microscopy 
reconstructions of dynamin oligomers showed that nucleotide binding 
leads to constriction of helical assemblies through rearrangements in 
the stalk region composed of the MD and GED". Furthermore, 
G domain dimerization via a conserved interface across the nucleo- 
tide-binding site was shown to mediate the stimulated GTPase activity”. 
We recently described the structure of the stalk of the dynamin-like 
antiviral myxovirus resistance protein 1 (MxA) GTPase and elucidated 
its mode of oligomerization, which involves three distinct interfaces and 
two loop regions in the stalk”®. Using this information, we succeeded to 
determine the structure of dynamin 1. 


The structure of human dynamin 1 


We reasoned that the propensity of dynamin to oligomerize at high 
protein concentrations might interfere with the formation of protein 
crystals. On the basis of our previous MxA study”, we assayed a num- 
ber of mutants in a human dynamin 1 construct (amino acids 6-746, 
Fig. 1a) for oligomerization defects. Indeed, a five-amino-acid exchange 
(IHGIR395-399AA AAA) in a conserved motif mapping to loop L2 of 
the MxA stalk’® interfered with higher-order assembly and resulted in a 
monodisperse dimeric dynamin 1 species (Supplementary Fig. 1, see 
also ref. 21). Crystals of a construct containing additionally the K562E 
mutation were obtained in the absence of nucleotides and diffracted to 
a maximal resolution of 3.7 A (Supplementary Table 1). The structure 
was solved by molecular replacement and refined to Rwork/Réree of 
28.4%/33.5% (Supplementary Table 1). To verify the sequence, 
the positions of 19 internal methionines were assigned by a single 
anomalous dispersion approach (Supplementary Fig. 2). 

Dynamin 1(APRD) has a four-domain architecture, composed of the 
G domain, the bundle signalling element (BSE), the stalk and the PH 
domain (annotated as superscript G, B, S and P, respectively), which does 
not strictly follow the sequence-derived domain boundaries (Fig. 1 and 


Predicted interface-3 


Figure 2 | The dynamin 1 dimer. In the crystals, stalks were arranged in a 
criss-cross fashion via crystallographic two-fold axis (black dotted lines). 
Assembly via the central interface-2 leads to an extended dynamin 1 dimer. 
Black rectangles indicate stalk interfaces shown in detail in the insets (see 


Interface-2 
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Supplementary Fig. 3). The structure of the amino-terminal G domain is 
very similar to that of the isolated nucleotide-free G domain’ (root- 
mean-square deviation (r.m.s.d.) of 1.4 A for 287 Co atoms) and shows 
a curved central B-sheet surrounded by a-helices at both sides. The two 
switch regions known to mediate nucleotide-dependent conformational 
changes and the cis stabilizing loop, involved in G domain dimeriza- 
tion’’, are partly disordered. At the N and C termini of the G domain, 
helices «1° and «2°, together with «3° from the C-terminal part of the 
GED of the same molecule, form a three-helix bundle, the BSE'*”? (Fig. 1 
and Supplementary Fig. 4). Compared to the previously described G 
domain-BSE construct in the GDPeAIF, -bound form”, «1? is shifted 
by two turns relative to «2°/a3° (Supplementary Figs 2 and 4), whereas it 
interacts with the G domain in a similar fashion in both structures. 

At the C-terminal end of «2¥, the BSE connects to the stalk of 
dynamin 1. The stalk is composed of a four-helix bundle where three 
helices, «15-035, are provided by the MD and 045 by the N-terminal 
part of the GED (Fig. 1 and Supplementary Fig. 5a). «1° in dynamin 1 is 
subdivided into «1N®, «1M® and «1C® by two disordered loops, LIN 
and LIC®, compared to a single break of the corresponding helix in 
MxA. Furthermore, «3° in dynamin 1 is extended by a highly con- 
served loop L4®. At the C terminus of the stalk, «4° closely packs against 
o1°-03° via hydrophobic contacts and leads the polypeptide chain 
back to the BSE. Despite an overall sequence identity of only 16%, 
the architecture of the dynamin 1 and MxA stalk is remarkably similar 
(r.m.s.d. of 2.6. A for 160 aligned Co atoms, Supplementary Fig. 5). 

The PH domain is interconnected between «3° and «4° of the stalk 
by two disordered loops, L1°” and L2*, and shows only minor devia- 
tions from the isolated PH domain of dynamin 1 (refs 14, 15; r.m.s.d. 
of 0.8 A for 102 Cx atoms). The three lipid binding loops”** are only 
partially resolved and point towards the solvent (Fig. 1b and 
Supplementary Fig. 3). 


Dynamin assembly via the stalk 

Similarly to the MxA stalks”®, the dynamin 1 stalks in the crystals were 
arranged in a criss-cross fashion resulting in a linear stalk filament. 
The highly conserved symmetric interface-2 of 1,200 A? is located in 
the centre of the stalk (according to the MxA nomenclature, Fig. 2 and 


Predicted interface-1 


Linear 


filament G domain 


S619L/W 


Supplementary Fig. 2 for PH domain assignment). Disease-related dynamin 2 
mutations causing centronuclear myopathy or Charcot-Marie-Tooth’ are 
represented by pink spheres, with the amino acid exchange indicated. 
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Supplementary Figs 3 and 5). Assembly via this interface results in an 
extended dynamin dimer that serves as building block for dynamin 
oligomers”®. Shape and dimension of this dimer agree well with a 
small-angle X-ray scattering study”. 

We previously showed that a second hydrophobic interface in the 
MxA stalk, interface-1, mediates assembly of higher-order oligomers”. 
In dynamin 1, however, the stalks do not contact each other directly at 
the predicted interface-1 (minimal distance 4.5 A, Supplementary 
Fig. 5). This difference is caused by a 5° tilt of the dynamin 1 stalks 
relative to the stalk axis. The hydrophobic nature of this surface in 
dynamin 1 and its conservation in the dynamin family (Supplementary 
Fig. 5c) is indicative of a similar function as in MxA as an oligomeriza- 
tion site. Closure of interface-1 might induce a pitch in the dynamin 
assemblies leading to helical oligomers rather than to ring-like struc- 
tures as in MxA”*. 

L2° containing the disruptive IHGIR395-399AAAAA mutation 
and LIN® are not ordered in the linear dynamin 1 oligomer 
(Supplementary Fig. 5). The corresponding loops in the MxA stalk 
form a third interface (interface-3) which also contributes to the 
assembly of oligomers (Fig. 2). Accordingly, mutations in both loops 
in dynamin 1 (ref. 21), MxA” and in L1 of dynamin 1-like protein”””° 
prevent oligomerization. Interestingly, Ser 347 and Tyr 354 in loop 
LIN’ in dynamin 1 are phosphorylated in vivo*'*? and might control 
the assembly status. 


Interactions of the BSE 


The BSE interacts with the central B-sheet of the G domain via a mostly 
hydrophilic interface of 1,100 A? (Supplementary Figs 4 and 6). In 
contrast, the BSE and concomitantly the G domain are only loosely 
associated with the stalk of the same molecule via loops L1”° and L2°° 
constituting a flexible hinge (hinge 1), as observed in other dynamin 
related proteins” (Figs 1b and 2, and Supplementary Fig. 2). 
Interestingly, Asp 744 at the C terminus of a3" of the BSE contacts 
Arg 440 in «2° of the neighbouring, parallel dynamin 1 stalk (Fig. 3a). 
A similar intermolecular interaction mediates oligomerization and 
the antiviral function of MxA (S.G., K.F., O.D., unpublished obser- 
vation). We tested the importance of this contact experimentally. The 
wild-type dynamin 1 construct bound efficiently to liposomes result- 
ing in an approximately 200-fold stimulation of GTPase activity (Sup- 
plementary Fig. 7). The single R440A and D744A mutants behaved as 
wild type in these assays. To analyse the role of these residues for cla- 
thrin-mediated endocytosis, dynamin 2-eGFP mutants (a fusion of 
dynamin 2 with enhanced green fluorescent protein) were re-expressed 
in HeLa cells depleted of endogenous dynamin 2. Both R440A and 
D744A mutants localized similarly as wild-type dynamin 2 to the plasma 
membrane (Supplementary Fig. 8), but transferrin internalization was 
increased (Fig. 3b and Supplementary Fig. 9). Thus, the salt bridge has 
an inhibitory and/or control function in dynamin-based endocytosis. 


The stalk-PH domain interface 


ot1 MS of the stalk forms a conserved surface of 370 A? with the PH domain 
(Figs 1b and 2, and Supplementary Figs 3 and 5c). Interestingly, 19 unique 
mutations causing centronuclear myopathy or Charcot-Marie-Tooth 
disease’ cluster in the stalk or the PH domain of dynamin 2, but none 
localizes to the G domain or BSE (Fig. 2). For example, mutations 
E368K/Q and R369W/Q in the stalk and A618T and S619L/W in 
the PH domain are directly in or in close vicinity to the interface 
between the two domains. Mutations A618T and S619L/W lead to 
increased oligomerization rates of dynamin in solution’, suggesting 
that the stalk-PH interface controls oligomerization. In gel filtration, 
the disease-related stalk mutant E368K also eluted as high molecular 
weight species. Consistently, this mutant showed a 20-fold increased 
basal GTPase rate, whereas the liposome-stimulated GTPase reaction 
and transferrin uptake were unchanged (Fig. 3b and Supplementary 
Fig. 7). In contrast, the R369W mutant behaved as wild-type dynamin 
in biochemical and endocytosis assays. Mutations at the periphery of 
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Figure 3 | Stalk interactions with the BSE and PH domain. a, Top view on 
the dynamin 1 oligomer. The PH domains are not drawn for clarity. The insert 
shows a close view of the intermolecular BSE-stalk interaction. b, HeLa cells 
depleted of endogenous dynamin 2 by short interfering RNA (siRNA) were 
transfected with a plasmid encoding siRNA-resistant dynamin 2-eGFP and 
allowed to endocytose fluorescently labelled transferrin. In transfected cells, 
fluorescence was quantified and normalized to mock-treated cells expressing 
eGFP. Data shown represent mean + standard error; *P < 0.05; **P < 0.01; 
***P <().0001 for wild-type versus mutant dynamin 2-eGFP, as determined 
by f-test. 


this interface (S619L, L621D) also did not compromise dynamin 
2-based endocytic activity (Fig. 3b), indicating that more subtle 
changes lead to the disease-phenotype. Interestingly, the F372D 
mutant in the centre of the interface showed significantly higher trans- 
ferrin uptake compared to wild-type dynamin 2, pointing also to an 
inhibitory and/or control function of this interface for dynamin-based 
endocytosis. 


Conformational changes during dynamin assembly 


Based on the isolated BSE and PH domain (Fig. 1b), the stalk dimer 
(Fig. 2), the GDP*AIF, -bound G domain dimer’’, and electron 
microscopy reconstructions of oligomerized dynamin 1 in the con- 
stricted state’’, we generated a molecular model for self-assembly of 
dynamin into helical structures (Fig. 4a and Supplementary Fig. 10). 
The resulting dynamin dimer spans a length of more than 260 A and 
covers 95° of a dynamin ring, thereby placing G domain and BSE 
above the neighbouring stalk. According to this model, oligomeriza- 
tion of dynamin proceeds along the central stalk, whereas the G 
domains mediate contacts between neighbouring turns (see ref. 20 
for a similar MxA model). 

When comparing the linear structure of dynamin 1 in the crystal 
with the helical assembly, large-scale domain movements are apparent 
(Supplementary Fig. 11). The G domain-BSE unit is shifted and rotated 
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Figure 4 | Model for dynamin oligomerization and function. a, Model of the 
oligomerized dynamin helix in the constricted state, in top and side view (see 
also Supplementary Fig. 10). Three dimers (1-3) are uniformly coloured. 
Whereas 13 stalk dimers complete one turn, the G domain of dimer (i) 
associates with the G domain of dimer (i + 10). b, Structure-based illustration 
for the proposed mechanism of the dynamin oligomer. Variations in the 
assembly of consecutive dynamin molecules lead to dynamin helices of 
different rise and diameter. For further explanation, see main text and 
Supplementary Fig. 12. 


around hinge 1 upon oligomerization. Additionally, the G domain 
separates from the BSE by a rotation around the invariant Pro 32 
and Pro294 (Supplementary Fig. 6). The corresponding residues 
Gly 68 and Gly 309 in BDLP**** and Pro 342 in atlastin***’ have also 
been suggested to act as a hinge (hinge 2). Integrity of the G domain- 
BSE interface is crucial for the function of dynamin, as indicated by the 
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aggregation of the I10D interface mutant in vivo (Supplementary Figs 6 
and 8). Furthermore, I10D behaved as a dominant-negative mutant in 
transferrin uptake assays (Fig. 3b). Moreover, the QD17-18AA muta- 
tion rendered dynamin 2 largely inactive in endocytosis assays, despite 
localizing correctly to clathrin-coated pits at the plasma membrane 
(Fig. 3b and Supplementary Fig. 8). 

The PH domains also undergo a pronounced rearrangement, to a 
position below the stalk, with the lipid-binding loops**** oriented 
towards the membrane (Supplementary Fig. 11a). We suggest that the 
stalk PH domain interface is disrupted by binding of the PH domain to 
high-affinity, phosphatidylinositol-4,5-bisphosphate-containing mem- 
branes such as the plasma membrane’, thereby promoting dynamin 
oligomerization. 

Finally, we suggest that rotation of stalk dimers via the flexible 
interface-1 and/or interface-3, leaving interface-2 unchanged, leads 
to bending of the linear oligomer and allows helix formation (Sup- 
plementary Figs 1lb and 12). Interestingly, the IHGIR395- 
399AAAAA mutation in interface-3 prevented liposome binding 
and, consequently, the liposome-stimulated GTPase activity (Sup- 
plementary Fig. 7). This mutant further behaved in a dominant- 
negative fashion in transferrin uptake assays (Fig. 3b) and displayed 
a diffuse localization and reduced recruitment to clathrin-coated pits 
(Supplementary Fig. 8). These results point to the central role of 
interface-3 for the function of dynamin. 


Discussion 

The present work, combined with prior studies , Suggests a struc- 
tural model for the mechanochemical coupling in dynamin that is 
consistent with previous models’"’. Our structural analysis indicates 
that the diameter of helical dynamin assemblies is controlled by the 
angle between two stalk dimers (Supplementary Figs 11b and 12f-h). 
We suggest that this angle is adjusted in response to GTP-dependent 
dimerization of the G domains: a relaxed conformation of interface-1 
is adopted in the absence of G domain constraints, whereas GTP- 
triggered dimerization of the G domains constrains rotation in inter- 
face-1 and induces a bent, constricted conformation (Supplementary 
Fig. 11b), possibly via the BSE-stalk interface (Fig. 3a). Interestingly, 
in molecular dynamics simulations of two stalk dimers, the bent 
conformation of interface-1 rapidly converts towards a relaxed state, 
concomitant with an opening of the helical oligomer (Fig. 4b and 
Supplementary Fig. 12). This supports our assumption that con- 
straints from G domains dimerization are required for the stabiliza- 
tion of the constricted state. 

Accordingly, dynamin initially assembles via the stalks with 
interface-1 in a relaxed conformation, allowing the filament to adopt 
a range of different diameters’* (state I in Fig. 4b and Supplementary 
Fig. 12). When the filament has embraced its template, GIP-loaded G 
domains of adjacent turns dimerize, the inhibitory stalk—BSE inter- 
face (Fig. 3a) is disrupted and the bent conformation of interface-1 is 
induced. This will result in constriction of the filament if the lipid 
template is flexible (state IV in Fig. 4b, constrictase model’) or com- 
paction of the dynamin helix if the lipid template is rigid (state II in 
Fig. 4b, poppase model’’). Constriction of a long dynamin helix will 
induce a sliding of neighbouring filaments until a new constricted 
equilibrium position of the oligomer is reached. This sliding is 
observed as a rotary movement of the dynamin helix upon addition 
of GTP in real-time imaging assays (twistase model’). To reach the 
constricted state along the whole assembly, several cycles of local 
release and rebinding of neighbouring dynamin turns might be trig- 
gered by GTP-dependent dimerization of G domains and dissociation 
after GTP hydrolysis (state III in Fig. 4b). Accordingly, GTP binding 
and hydrolysis are both required for the mechanochemical function of 
dynamin** and might induce local opening or twisting of the con- 
stricted dynamin helix. The resulting shear forces could tear the 
underlying membrane. 


18-20,38 
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The complex domain interplay in dynamin rationalizes the high 
degree of structural conservation in the dynamin superfamily, sheds 
light on the molecular mechanisms of disease-associated mutations 
and highlights structural features of the nucleotide-free state as a 
prerequisite to understand dynamin’s mechanochemical function. 


METHODS SUMMARY 


A human dynamin 1 construct (residues 6-746) containing the IHGIR395- 
399AAAAA and K562E mutations was expressed as a His¢-tag-fusion in 
Escherichia coli and purified to homogeneity. Crystals were obtained using 
PEG400 and isopropanol as precipitants. The structure was solved by molecular 
replacement. To verify the model, the positions of 19 out of 26 methionines were 
determined by an anomalous data set of SeMet-substituted crystals. Liposome 
binding assays were carried out as described (http://www.endocytosis.org). 
GTPase assays were carried out at 37°C using 20 mM HEPES/NaOH, pH7.5, 
150mM NaCl, 2mM KCl, 2mM MgCl) as reaction buffer. Uptake of fluores- 
cently labelled transferrin was monitored in HeLa cells depleted of endogenous 
dynamin 2 by siRNA and transfected with the indicated siRNA-resistant dyna- 
min 2 constructs. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Protein expression and purification. Human dynamin 1 (residues 6-746) and 
indicated mutants of this construct were expressed from pET46-EK/LIC 
(Novagen) as N-terminal His,-tag fusion followed by a PreScission cleavage site. 
The crystallized construct contained mutations IHGIR395-399AAAAA which 
prevented oligomerization and K562E which reduced DNA contaminations dur- 
ing purification. Proteins were expressed in Escherichia coli host strain Rosetta2- 
BL21-DE3, and bacteria were cultured in autoinduction medium (Novagen) for 
7.5h at 37°C followed by temperature shift to 20°C for overnight expression. 
Selenomethionine (SeMet)-substituted human dynamin 1 was expressed in M9 
minimal medium, supplemented with L-amino acids Lys, Phe, Thr (100 mg 1%); 
Ile, Leu, Val, SeMet (50 mg] — 4). using the same vector and host strain as for native 
protein expression”. Cells were resuspended in buffer A (25 mM HEPES/NaOH 
(pH7.9), 500 mM NaCl, 2mM MgCl, 111M DNase (Roche), 500 uM Pefabloc 
(Roth)) and disrupted by a microfluidizer (Microfluidics). Cleared lysates 
(95,000g, 1h, 4°C) were incubated with Benzonase (Novagen) for 30 min at 
4°C before application to a Co”*-Talon column (Clontech). Protein was eluted 
with buffer A containing additional 100mM imidazole. Fractions containing 
human dynamin 1 were incubated with 2.4mM -mercaptoethanol and Hisg- 
tagged Prescission protease overnight at 4°C. Using 50 kDa molecular weight 
cut-off concentrators (Amicon), imidazole, B-mercaptoethanol and the free His- 
tag were removed by washing with buffer A, before a second application to a 
Co**-Talon column to remove non-cleaved His-tagged dynamin 1 and protease. 
The flow-through and four column volumes of washing buffer A were collected 
and concentrated. Finally, dynamin 1 was purified by size exclusion chromato- 
graphy on a Superdex200 column (GE) in buffer containing 25 mM HEPES/ 
NaOH (pH 7.5), 300 mM NaCl, 2mM MgCh. Fractions containing dynamin 1 
were pooled, concentrated and flash-frozen in liquid nitrogen (Supplementary 
Fig. 1). SeMet-substituted protein was purified using the same protocol. 
Crystallization and structure determination. Crystallization trials by the 
sitting-drop vapour-diffusion method were performed at 4°C using a Hydra- 
plus-One pipetting robot (Matrix Technologies Corporation) and Rock Imager 
storage system (Formulatrix). The human dynamin 1 construct (300 nl at a 
concentration of 12mgml~') was mixed with an equal volume of reservoir 
solution containing 9% PEG400, 6% isopropanol, 100mM HEPES/NaOH 
(pH7.3), 10 mM MgCl, and 10 mM KCL. Crystals of the native protein appeared 
after 2 weeks and had dimensions of 0.2 mm X 0.05 mm X 0.02 mm. Crystals of 
SeMet protein were obtained in 6% MPD, 10% isopropanol, 0.1 M HEPES/NaOH 
(pH7.5). During flash-cooling of the crystals in liquid nitrogen, a cryo-solution 
containing additionally 20% ethylene glycol was used. All data were recorded at 
BL14.1 at BESSY II, Berlin. One native data set was collected from a single crystal 
and processed and scaled using the XDS program suite*!. Phase information was 
obtained by molecular replacement with Phaser’ using the structure of the 
nucleotide-free rat dynamin 1 G domain", the stalk of human MxA” and the 
human PH domain" as search models. The model was built using COOT” and 
iteratively refined using CNS1.3 with a deformable elastic network“, followed by 
10 cycles of TLS refinement using 1 TLS group per domain in refmac5 (ref. 45). To 
confirm the amino acid sequence, a data set of a SeMet-substituted protein crystal 
was recorded at the selenium peak wavelength. Owing to non-isomorphism of the 
native and SeMet crystals, molecular replacement with the refined native struc- 
ture was performed. The positions of 19 out of 26 selenium atoms in the dynamin 
1 construct were determined by calculating an anomalous difference map with the 
CCP4 program suite*® using the calculated phases after refinement. The final 
native model comprises amino acids 6-746. Residues 63-64, 110-112, 143- 
149, 347-356, 394-404, 446-447, 500-517, 534-537, 578-581 and 632-652 are 
disordered. Electron density for loop L2°* (amino acids 709--715) was only 
visible in the SeMet derivative structure and was therefore not modelled in the 
native structure. However, L2°* was included for figure preparation. Of all resi- 
dues, 86.9% are in the most favoured regions of the Ramachandran plot and no 
residue in the disallowed regions, as analysed with Procheck’’. Figures were 
prepared with PyMol®. Buried surface areas (per molecule) were calculated using 
CNS“. Solvent-accessible areas per residue were calculated using areaimol*. 
Domain superpositions were performed with lsqkab**. Sequences were aligned 
using CLUSTAL W” and adjusted by hand. 

Liposome co-sedimentation assays. Liposomes were prepared as previously 
described (http://www.endocytosis.org). Folch liposomes (total bovine brain 
lipids fraction I from Sigma) in 20 mM HEPES/NaOH (pH 7.5), 150 mM NaCl, 
2mM KCl were extruded through a 0.1 jum filter. Liposomes (0.1 mg ml ') were 
incubated at room temperature with 4.0 LM of the indicated dynamin 1 construct 
for 10 min in 40 ul reaction volume, followed by a 70,000g spin for 10 min at 
20 °C. 

GTP hydrolysis assay. GTPase activities of 1 uM of the indicated dynamin 1 
constructs were determined at 37 °C in 20 mM HEPES/NaOH (pH7.5), 150 mM 
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NaCl, 2mM KCl, 2mM MgCl, in the absence and presence of 0.1 mg ml! 
0.1-1m-filtered Folch liposomes, using saturating concentrations of GTP as sub- 
strate (1 mM for the basal and 3 mM for the stimulated reactions). Reactions were 
initiated by the addition of protein to the reaction. At different time points, 
reaction aliquots were diluted 15-fold in GTPase buffer and quickly transferred 
to liquid nitrogen. Nucleotides in the samples were separated via a reversed-phase 
Hypersil ODS-2 C18 column (250 X 4mm), with 10 mM tetrabutylammonium 
bromide, 100 mM potassium phosphate (pH 6.5), 7.5% acetonitrile as running 
buffer. Denatured proteins were adsorbed at a C18 guard column. Nucleotides 
were detected by absorption at 254nm and quantified by integration of the 
corresponding peaks. Rates were derived from a linear fit to the initial reaction 
(<40% GTP hydrolysed). 

Transferrin uptake in HeLa cells. HeLa cells were transfected with siRNA using 
Oligofectamine (Invitrogen) on day 1. The sequence of the siRNA against human 
dynamin 2 is 5’-GCAACUGACCAACCACAUC-3’, targeting nucleotides 849- 
867. On day 2, cells were transfected with pEGFP-N1 (Clontech) or siRNA-resistant 
rat dynamin 2-pEGFP-N1 using lipofectamine 2000 (Invitrogen) and seeded on 
coverslips. On day 3, cells were serum-starved and incubated with 201g ml? 
transferrin-Alexa568 (Molecular Probes, Invitrogen) for 10 min at 37 °C. After four 
PBS washes on ice, cells were paraformaldehyde-fixed for 20 min at room temper- 
ature. Transferrin uptake was analysed using a Zeiss Axiovert200M microscope and 
Slidebook imaging software (3i Inc.). Internalized transferrin was quantified from 
transfected cells only and normalized to the value of eGFP-transfected, mock- 
treated cells (n = 28-83 images, five independent experiments; IHGVR395- 
399A AA, I10D, QD17-18AA: three independent experiments; E368K, R369W, 
S619L, F372D: two independent experiments). Knockdown of dynamin 2 and 
expression levels of dynamin 2-eGFP mutants were assessed by immunoblotting 
using antibodies to dynamin 2 (a gift of M. A. McNiven), B-actin (clone ac-15, 
Sigma-Aldrich) and eGFP (clone jl-8, Clontech). 

Localization of dynamin 2-eGFP mutants. HeLa cells were transfected with 
dynamin 2-pEGFP-N1 wild-type or mutant constructs 20 h before fixation in 4% 
paraformaldehyde for 12 min at room temperature. Cells were blocked and per- 
meabilized in 10% goat serum, 0.3% Triton X-100, 100 mM NaCl in phosphate 
buffer and stained for endogenous adaptor complex AP-2 using an antibody to 
a-adaptin (clone AP-6, Abcam). Total internal reflection fluorescence (TIRF) 
imaging was performed using a Zeiss Axiovert200M microscope equipped with 
a X 100 TIRF objective and a dual-colour TIRF setup from Visitron Systems using 
Slidebook imaging software (3i Inc.). 

Loop modelling and molecular dynamics simulations. For modelling of the 
unresolved loop regions LIN® and L2°, two stalk dimers in the constricted state 
served as scaffold. Using Modeller (9v8)°*°, the scaffold was fixed in position, 
whereas L1N® and 128 could freely sample the empirical potential function. To 
reduce the conformational search space, additional harmonic distance restraints 
were added between conserved residues Arg 399—Asp 406 and Glu 355—Arg 361. 
Based on the modelled stalk tetramer, five independent all-atom molecular 
dynamics simulations (NVT ensemble), each of 90ns, were conducted at 
T = 300K in a periodic boundary setting using Gromacs 4.5.3 (ref. 51). The 
model was immersed in a rectangular 20 nm X 10 nm X 9nm box, containing 
approximately 56,400 water molecules, 21 sodium and 17 chloride ions to neut- 
ralize the system, resulting in a total number of 185,857 atoms. As force fields, 
Amber99 (protein and ions)** and TIP3P (water)°’ were applied. To treat long- 
range interactions, the Particle-mesh Ewald method**** was used. A cut-off of 
1nm was used for the real parts of electrostatic and van der Waals interactions. 
All hydrogen bonds were constrained by using the LINCS” algorithm, allowing 
for an integration time-step of 2 fs. For the thermostatted integration, Langevin 
dynamics were used as implemented by the Gromacs s.d. integrator (tau_t = 1). 

For the calculation of bending and twisting angles, each of the four stalk mono- 
mers was represented by two geometric centres, defined as the mean position of Ca 
atoms of residues 366-377, 420-430, 468-481 and 671-683 for position A and 
residues 360-365, 428-445, 457-472 and 686-701 for position B (Supplementary 
Fig. 12d). The stalk bending angle « was defined as the mean angle between parallel 
stalks, and the twisting angle f by the minimal angle between the planes spanned 
by each dimer (positions A, B, B’ in Supplementary Fig. 12f). 

For each simulation time step, the corresponding stalk tetramer structure 
describes a linear transformation of the first dimer onto the second dimer, con- 
sisting of a translation vector and a rotation matrix. This linear transformation was 
used to reconstruct the structure of an ideal dynamin helix by applying it to the 
dynamin dimer model in the constricted state. The diameter, d, and the rise per 
turn, r, of these helices were measured by using the geometric centres of the stalk 
coordinates and obtaining trajectories in (d,r). Based on these trajectories, the free 
energy surface of stalk helix conformations was calculated: the two-dimensional 
space (d,r) was discretized into boxes of size 25 X 25 A. Based on the simulation 
trajectories, the transition probability between all pairs of boxes was computed, 
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which allowed the calculation of an equilibrium probability of finding a single stalk 
tetramer in a given box, p;(d,r)°’. When more than two stalk dimers are assembled, 
non-cooperative behaviour of neighbouring dimers has to be considered, for 
example, the assembled stalk dimers can almost independently switch between 
different conformations. The resulting equilibrium distribution of two independ- 
ent tetrameric units would be given by the convolution of two single-tetramer 
distributions, p2(d,r). It was found that for only about three such convolutions, 
the resulting probability distribution converges to p3(d,r) ~ p(d,r). Thus, assuming 
that the helix has at least three independently switching subunits, the free energy 
landscape is unique, and is given by F(d,r) = —kTIn(p(d,r)), where k is the 
Boltzmann constant and T the temperature. 
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The crystal structure of dynamin 


Marijn G. J. Ford', Simon Jenni” & Jodi Nunnari! 


Dynamin-related proteins (DRPs) are multi-domain GTPases that function via oligomerization and GTP-dependent 
conformational changes to play central roles in regulating membrane structure across phylogenetic kingdoms. How 
DRPs harness self-assembly and GTP-dependent conformational changes to remodel membranes is not understood. 
Here we present the crystal structure of an assembly-deficient mammalian endocytic DRP, dynamin 1, lacking the 
proline-rich domain, in its nucleotide-free state. The dynamin 1 monomer is an extended structure with the GTPase 
domain and bundle signalling element positioned on top of a long helical stalk with the pleckstrin homology domain 
flexibly attached on its opposing end. Dynamin 1 dimer and higher order dimer multimers form via interfaces located in 
the stalk. Analysis of these interfaces provides insight into DRP family member specificity and regulation and provides a 
framework for understanding the biogenesis of higher order DRP structures and the mechanism of DRP-mediated 


membrane scission events. 


Dynamin-related proteins (DRPs) belong to a highly conserved (Sup- 
plementary Fig. 1) GTPase superfamily that catalyses diverse mem- 
brane remodelling events'*. Membrane scission DRPs include dynamin 
1, which catalyses clathrin-coated vesicle scission at the plasma mem- 
brane and Drp1/Dnm1, which divide mitochondria. Despite their 
functional diversity, all DRPs undergo GTP cycle-dependent con- 
formational changes to regulate self-assembly and disassembly**. 
DRP architecture includes an amino-terminal GTPase domain, a 
bundle signalling element (BSE), a middle domain (MD) and a 
GTPase effector domain (GED)*"°. Many DRPs also have a variable 
region between the MD and GED; in dynamin 1, this is a pleckstrin 
homology domain (PH) that binds to phosphatidylinositol-4,5-bispho- 
sphate (PtdIns-4,5-P)-containing membranes to facilitate targeting 
and possibly membrane remodelling via membrane insertion'*™*. 
Assembled DRPs can form helical structures in vitro’*'®. Within these 
structures, GTP cycle-driven conformational changes result in mem- 
brane remodelling**”"”. The structural basis for DRP self-assembly and 
GTP cycle-dependent conformational changes are not fully understood. 
The mechanism of assembly has been informed by the structure of the 
‘stalk’ of MxA, a distantly related DRP, in which the MD and part of the 
GED form an extended helical bundle that mediates self-assembly via 
conserved interfaces'®. Several structures have also provided insight into 
DRP GTP cycle conformational changes (Supplementary Fig. 2)'?~*. 
Cryo-electron microscopic structures of assembled dynamin in guano- 
sine-5’-[(B,y)-methyleno]triphosphate (GMPPCP)-bound and nucleo- 
tide-free states have provided models for the assembled oligomers and 
the location of the GTPase and PH domains within the helical struc- 
tures'”**. However, to understand the basis of DRP self-assembly and 
mechanism, the architecture of DRP domains within a single molecule 
must be elucidated. Here, we report the crystal structure of an assembly- 
deficient dynamin 1 in the nucleotide-free state that lacks only the 
unstructured carboxy-terminal proline-rich domain (PRD). 


G397D, an assembly-deficient dynamin 1 mutant 


The propensity of DRPs to assemble has been an obstacle to obtaining 
crystals suitable for diffraction experiments. We identified the muta- 
tion G436D, an invariant MD residue, in the Saccharomyces cerevisiae 
mitochondrial division DRP Dnm1, by screening for mutations that 
possessed the same phenotype as assembly-deficient Dnm1 G385D, 


specifically mutations that shifted Dnm1-green fluorescent protein 
(GFP) fusion from a punctate to a more diffuse localization pattern in 
yeast cells’®. We expressed and purified (Supplementary Fig. 3a) the 
orthologous rat dynamin 1 mutant protein lacking the PRD (Dyn1 
G397D APRD) and examined its ability to self assemble using light 
scattering (Fig. 1a). Addition of GMPPCP caused an increase in scatter- 
ing in Dyn1 APRD samples, but caused no change in scattering in Dyn1 
G397D APRD samples. Using a combination of size-exclusion chro- 
matography and sucrose gradient centrifugation’*®, Dynl APRD and 
Dyn1 G397D APRD were estimated to be dimeric (Supplementary 
Fig. 3b and Supplementary Table 1) under non-assembly conditions, 
similar to other assembly-deficient DRP mutants'*”’”. Under assembly 
conditions, we observed an increase in the sedimentation coefficient of 
Dyn1 APRD, but not for Dyn1 G397D APRD (Supplementary Table 1). 
Dyn1 G397D APRD was defective for assembly-stimulated GTP hydro- 
lysis (Fig. 1b and Supplementary Table 2) and failed to assemble into 
helical structures on phosphatidylinositol 4-phosphate (PtdIns-4P)- 
containing lipid nanotubes, in contrast to Dyn] APRD (Fig. Ic). 
Dyn1 G397D APRD was also soluble at higher concentrations than 
Dyn1 APRD. Together, these data indicate that the G397D mutation 
severely hampers dynamin self assembly and substantiates the critical 
role of the MD in intermolecular interactions'®'*’””. Given these 
characteristics, it presented an attractive target for crystallization. 


Crystallization and structure of Dynl G397D APRD 


We obtained orthorhombic crystals of Dyn1 G397D APRD that dif- 
fracted to a minimum Bragg spacing of 3.1A (Methods and 
Supplementary Table 3). The structure was solved by molecular 
replacement, using the nucleotide-free rat dynamin 1 GTPase 
domain, the human dynamin 1 PH and a portion of the human 
MxA stalk, as sequential search models (Methods)'"'!*"*°°. We traced 
the complete model, except for some disordered loops, and assigned 
amino acids after refinement of the molecular replacement solution. 
The model was refined to R/R¢rce values of 0.21/0.27. A representative 
example of the B-sharpened likelihood-weighted 2mF ps — DFcaic 
electron density map is shown in Fig. 2c. 

Dyn1 G397D APRD forms an extended structure with the GTPase 
and PH domains separated by a stalk consisting of an anti-parallel 
helical bundle of the MD and a helix from the GED (Fig. 2a, b). By 
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Figure 1 | The G397D mutation in Dyn1 APRD blocks self-assembly. a, 90° 
light scattering. Dynl APRD (blue, 1 1M ) and Dyn1 G397D APRD (red, 11M) 
were monitored after addition of 0.5 mM GMPPCP and, in the case of Dyn1 
APRD, 1 mM GTP (blue arrows). Red arrow indicates opening of sample port. 
AU, arbitrary units. b, Steady-state GTP hydrolysis kinetics of Dynl APRD and 
Dyn1 G397D APRD in the absence (light blue and red) and presence (dark blue 
and orange) of 0.1 mgml ' liposomes containing 10% PtdIns-4P, monitored 
by a NADH-dependent coupled assay as described’. A representative trace is 
shown with 1 mM GTP. c, Transmission electron microscopy of negative- 
stained 0.25mgml | 10% PtdIns-4P lipid nanotubes with Dyn] APRD 
(middle) and Dyn1 G397D APRD (right) and no added protein (left) and 

0.5 mM GMPPCP. Scale bars, 200 nm. 


contrast, the distantly related bacterial dynamin-like protein (BDLP) 
is compactly folded in its nucleotide-free and GDP-bound states (Sup- 
plementary Fig. 2a), with its ‘paddle’, in an analogous region to the PH 
domain of dynamin, in close proximity to the GTPase domain’””’. Thus, 
unlike BDLP, GTP binding in dynamin is not harnessed to form an 
extended structure, consistent with nucleotide-independent assembly of 
dynamin on liposomes***'?. Linkers connecting the MD to the PH and 
the PH to the GED are disordered, indicating a flexibly tethered PH. 
Thus, any of three crystallographic symmetry-related PH domains 
could connect to the remainder of the structure (Fig. 2d). The probable 
PH partner, based on the fit with the envelope of an assembly-deficient 
dynamin dimer determined by small-angle X-ray scattering*’ (SAXS), is 
shown in Fig. 2b. The structure of the PH is similar to those previously 
determined'’”’, with expected differences concentrated in the variable 
loops. 

The structure of the GTPase domain is similar to that of the previ- 
ously determined nucleotide-free dynamin GTPase domain struc- 
ture*’. There are minor expected changes in the poorly resolved 
switch 2 region, the dynamin-specific loop and in the loop connecting 
the Nerpase to the GTPase domain. The Cgrpase helix is kinked at the 
conserved proline 294 and, together with the Norpase helix and a helix 
at the C terminus of the GED (Cggp), forms the three-helix BSE’. 
Corp covers a groove of 937 A? between the Norpase and Corpase 
helices. In the nucleotide-free dynamin 1 GTPase and the GDP- 
bound Dictyostelium discoideum dynamin A GTPase structures, a 
myosin peptide substitutes for Copp, indicating the importance of 
this interface for GTPase domain stability****. 

A linker with elevated crystallographic temperature factors con- 
nects the Cerpase to the dynamin stalk (Fig. 2a, b) and contains two 
prolines (319 and 322), whose equivalents in the distantly related DRP 
atlastin 1 connect the GTPase and MD (Supplementary Fig. 2b). 


562 | NATURE | VOL 477 | 29 SEPTEMBER 2011 


23 274 317 496 628 743 864 


GTPase Middle PH GED PRD 


Cotpase 


N GTPase 


Figure 2 | The crystal structure of Dyn1 G397D APRD. a, Schematic of the 
Dyn1 domain structure. Numbers indicate domain-ending amino acid. Colour 
scheme used here is retained. b, Crystal structure of Dyn1 G397D APRD. 
Linkers between the Nerpase and GTPase domain and the Cerpase and MD are 
shown in grey. Loops with no density are represented with dashed lines. Stalk 
nomenclature is based on that of the MxA stalk structure’’. VL, variable loop. 
c, An example of the refined to B-sharpened 2mF,p5 - DF-aic map, contoured at 
lo. The region shown is part of the stalk boxed by purple dotted lines in 

(b). d, Three symmetry-related PH domains in the lattice. 


Although the dynamin stalk sequence shares limited identity to 
MxA, its overall structure is similar and the MxA nomenclature is 
retained'* (Supplementary Fig. 4a). As for MxA, helix «1 is split by a 
disordered loop, L1. The remainder of «1 diverges from MxA and is 
split into two helices, termed «1°' and «1°. Helix «1° connects to 
helix «2 via a short disordered loop, L2. Helices «2 and «3 run the 
length of the stalk and are joined by a short loop, L3. The stalk is 
completed by GED-derived helix «4 that spans the stalk and connects 
to Cgep Via a linker. Following helix «3 is a coil that folds across «4 
that is strongly conserved in dynamins, Drp1s and Dnm1, but absent 
in MxA. 


The dynamin-dimer interface 
The crystal lattice contains linear filaments of dynamin assembled via 
three stalk interfaces, similar to MxA”, resulting in layers of interact- 
ing stalks separated by GTPase and PH domains (Fig. 3a, b). Interface 
2, the largest with a buried surface of 1,339 A’, has two-fold symmetry 
and is formed by residues from stalk helices «4 and «3 (Fig. 3c), with 
an additional residue from «1°! (H367). Each protomer in the inter- 
face contributes seven direct hydrogen bonds and eight hydrophobic 
residues line the site of contact between 03 and «4 (Fig. 3c). 
Interface 2 sequence conservation indicates a mechanism for dimer 
specificity within the DRP superfamily. Phylogenetic analysis of the 
hydrogen bonding partners within this region in dynamins, Drp1s 
and Dnm1, and Mx proteins allowed us to categorize three classes of 
residues: conserved in dynamins, Drp1/Dnm1, and Mx proteins (in 
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Figure 3 | Dynl G397D APRD stalk interfaces mediate self-assembly. 

a, Schematic diagram of four monomers showing interfaces in the crystal lattice. 
b, Surface representation showing the locations of interfaces 1 and 2. c, Detail of 
interface 2. Protomers are shown in lighter and darker hues. Green dotted lines 
are hydrogen bonds. d, Conservation of interface 2 residues involved in 
hydrogen bonding in dyamins, Drp1/Dnm1 and Mx proteins. Blue, conserved 
in dynamins, Drp1/Dnm1 and Mx proteins; yellow, conserved in dynamins and 


blue), conserved only within dynamins and Drp1/Dnm1 (in yellow), 
and conserved only in the dynamins (in red) (Fig. 3d). Plotting these 
classes onto a surface representation of the dynamin monomer 
(Fig. 3e) indicates that dimerization specificity is controlled by a 
spatial combinatorial code. As predicted from this code, hetero-oli- 
gomers consisting of dynamin 1 and dynamin 2 are observed”. 
Several residues contributing to the apparent specificity are localized 
in the strongly conserved coil of dynamin, Drp1] and Dnm1 that 
follows «3. 


Dynamin stalk interfaces drive self assembly 

Interfaces 1 and 3 are also stalk-localized and mediate higher order 
assembly of dynamin dimers (Fig. 3a). Interface 1 is at the tips of 
interacting stalks, proximal to the GTPase domain and BSE (Fig. 3f) 
and is formed through interactions between protomer helices «4 
and «1, The interface is capped by a stacking interaction between 
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Drps; red, conserved in dynamins. Alignment shows a subset of sequences used 
to determine the conservation. Sequences are identical to Supplementary Fig. 1, 
with the addition of Rattus norvegicus Mx1 (P18588.1) and Homo sapiens Mx1 
(P20591.4). Fruitfly, Drosophila melanogaster; human, Homo sapiens; rat, 
Rattus norvegicus; worm, Caenorhabditis elegans; yeast, Saccharomyces 
cerevisiae. e, Surface representation of conservation data shown in 

(d). f, Interface 1. Density interpreted as a PEG400 molecule is shown in black. 


opposing and flexible Y706 residues, conserved in dynamins and 
Drpls. Interface 1 includes four hydrogen bonds (Fig. 3f, dashed 
green lines), with the remainder consisting of hydrophobic residues. 
The buried surface area of interface 1 is relatively small (624 A?) andis 
likely to be dynamic to tolerate protomer interactions in a range of 
orientations. 

Interface 3 is at the distal end of the stalk, where L2 loops from 
symmetry-related molecules are in close proximity (Supplementary 
Fig. 4b). The G397D mutation is within L2, which could not be traced 
due to poor density. The N-terminal region of L2 is also in close 
proximity to a symmetry-related L1, which features a highly con- 
served glycine, G346, that when mutated in Dnm1 (G385D) blocks 
assembly beyond the dimer’®. We therefore predict that G397 and 
G346 are near one another in assembled dynamin. In addition, muta- 
tions at two dynamin arginine residues, R399 (in L2) and R361 (in 
1°", following L1), and the corresponding interface 3 mutations in 
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MxA also inhibit assembly and stabilize dimeric forms'*”’. In con- 
trast, disruption of MxA interface 1 yields a mixture of dimers and 
tetramers!®. Thus, our data indicate that interface 3 is critical at an 
early step in the dynamin assembly pathway as its disruption stabilizes 
dimeric dynamin and allows linear filament formation. In contrast, 
interface 1 interactions probably function only to stabilize oligomer- 
ization. 

From our structure, we propose that the dynamin dimer interface 2 
is constitutive and relatively rigid. As compared to linear arrangement 
in the crystal, in helical assemblies we propose that the necessary 
rotational and translational shifts occur between adjacent dimers at 
interfaces 1 and 3 (Fig. 4a). Indeed, stalk dimers fit into the GMPPCP- 
bound helical electron microscopic reconstruction of dynamin” 
(Fig. 4b) possess a more tightly packed interface 3 and interface 1. 
These differences can be attributed to the disordered interface 3 in the 
crystal that is probably due to steric hindrance from the G397D 
mutation, and the dynamic nature of the hydrophobic interface 1. 


Regulation of DRPs 


The dynamin PH is essential for endocytosis and interacts with inositol 
phospholipids with low affinity’***. Centronuclear myopathy (CNM) 
disease mutations cluster at the C terminus of the PH a-helix (Sup- 
plementary Fig. 5a), underscoring its importance. They cause an 
increase in basal GTPase activity, without altering interactions with 
inositol phospholipids**. In addition, SAXS analysis of an assembly- 
deficient dynamin indicates that the CNM mutants have a different 
conformation compared to wild type**””. In our lattice, three PH domains 
related by crystallographic symmetry lie close to interface 3 and the L1 
loop (Fig. 2d), suggesting an interaction. Thus, the PH may serve to 
regulate access to this key multimerization interface to couple dynamin 
membrane interactions to dynamin assembly. Phosphoinositide binding 
by the PH variable loops and penetration of the membrane by variable 
loop 1% could help to expose dynamin interface 3 and/or L1 and thus 
promote multimerization. 

Alignment of the dynamin PH and corresponding sequences from 
Drp1s shows conservation of key residues (Supplementary Figs 1 and 
5b), indicating that lipids may similarly regulate mammalian mito- 
chondrial division. However, mammalian Drpls lack most of the 
C-terminal PH o-helix, including residues mutated in CNM. The 
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details of Drp1 interface 3 regulation may therefore be different. 
Consistently, Drp1 splice variants have deletions in this region, point- 
ing to a potential regulatory role. By contrast, Danio rerio Drp1 and 
fungal Dnm1 have an insert B of unknown structure in this region. 
Mitochondrial division DRPs can therefore be subdivided on this 
basis, which correlates with the divergence of their corresponding 
effectors, yeast Mdv1 and mammalian Mff**. 


Discussion 


Our structure provides insight into the mechanism and regulation 
of dynamin assembly and into how the dynamin GTPase cycle is 
harnessed for function. Several observations point to a key role of 
the dynamin BSE in the formation of the GIPase-GTPase dimer 
interface observed in the GTPase-Cgpp GDP*AIF, crystal struc- 
ture’®. This interface is likely to form from adjacent rungs of a dynamin 
helix and is critical to dynamin function as it mediates assembly- 
stimulated GTP hydrolysis. Comparison of the GTPase-GTPase 
dimer with our nucleotide-free structure indicates that in addition to 
expected differences, the BSE is flexible. Dynamin genetic data also 
support a role for the BSE in the regulation of GI Pase-GTPase inter- 
face formation. Specifically, the dynamin switch 2 shibire ts2 mutation, 
G146S, which causes endocytic intermediates with ‘collared’ dynamin 
necks to accumulate”®, is suppressed by the sushi mutation A738T*”, 
located in the Cgrp peptide, facing the hydrophobic groove (Fig. 5a). 
Insight into how dynamin undergoes conformational changes also 
comes from distantly related DRP structures (Supplementary Fig. 2). 
GTP-dependent GTPase domain dimerization is also observed for 
guanylate binding protein, indicating that this may be a common 
feature of the DRP superfamily” (Supplementary Fig. 2c). In addition, 
two recent structures of the atlastin 1 cytosolic domain, thought to 
represent pre- and post-endoplasmic reticulum membrane fusion con- 
formations*’, indicate that large changes occur in the position of the 
3-helix bundle ‘middle domain’ relative to the GTPase domain 
(Supplementary Fig. 2b). 

We propose that GTP binding and self-assembly promote dynamin 
GTPase-GTPase dimer formation via an opening of the BSE relative 
to the GTPase domain. In support, a modified version of our structure 
can be fit into the cryo-electron microscopic reconstruction of 
GMPPCP-bound dynamin with the BSE in a substantially more open 
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Figure 4 | Oligomerization of dynamin into helical structures. a, Dynamin 
helices derived from the linear arrangement in our crystal structure. Two stalk 
dimers (green and magenta) that engage in interface 1 and 3 are related by 
crystallographic translation. Experimentally determined helical parameters for 
dynamin assembled into helices in the GMPPCP-bound state” were matched 
by applying a small shift and tilt of one stalk dimer with respect to the other. 
b, Placement of oligomerized dynamin model into the electron microscopy 
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density map contoured at 1.20. In side view: the fit of the GTPase domain as a 
GTPase-GTPase dimer with the BSE in open conformation to connect to 
interface 1 of the stalk helix (solid density is contoured at 3.60). c, Observed 
conformational flexibility of the BSE. Model fitted into the helical 
reconstruction is shown as black superimposed ribbon on the crystal structure 
of the GTPase-Cggp fusion dimer (PDB accession no. 2X2E)"”. 


©2011 Macmillan Publishers Limited. All rights reserved 


a b 
Switch 2 
\ 


shibire ts1 
(G273D) 


GMPPCP GDP-AIF4— 
shibire ts2 (61468) 


PH domain ee it 
& GTPase “ 


Rn 


Stalk BSE 


GED 


A 


| Helix axis 


PH domain = 


aoe hydrolysis 


Helix axis 


---> 


80° 


go° 


Closed 


Open Helix axis | conformation 


conformation 
GTP hydrolysis 
>» 


Disassembly, 


) > 


GDP.AIF,- Nucleotide free 


Figure 5 | Model for dynamin GTP cycle conformational changes. 

a, Mapping of dynamin shibire and sushi mutations. b, Nucleotide-dependent 
dynamin conformations. The GTPase core domains (red) are in the same 
orientation. Left, GTP-bound state with open BSE conformation of dynamin as 
fitted into the GMPPCP-bound electron microscopic reconstruction shown in 
Fig. 4. Right, transition state of dynamin obtained by superposition of the BSE 
residues 291-312 and 727-743 of our structure on the corresponding residues 
of the GDP*AIF, -bound GTPase-Cggp fusion dimer (PDB accession no. 
2X2E)"°. Transition from open to closed BSE conformation results in 
movement of stalk domains. c, Model for Dyn1 APRD GTP-bound helix. The 
BSE is opened to allow GIPase-GTPase dimer formation. d, GTP hydrolysis 
closes the BSE and adopts the conformation of the GDP*AIF, -bound 
transition state. This results in a substantial global constriction of the helical 
oligomeric assembly causing membrane deformation and scission. e, Schematic 
of how the proposed GTP hydrolysis triggered BSE conformational change is 
transmitted to oligomerized stalk domains. 


conformation (Fig. 4b). We obtained a good fit of the GTPase- 
GTPase dimer domains and stalk interface 1 using constraints dictated 
by the helical fit of the oligomeric stalk and the strong GTPase domain- 
derived density (Fig. 4b). The BSE a-helices fit into the density stretch- 
ing from the GTPase domain to the oligomeric stalk, indicating that a 
BSE flipping-out motion occurs at two hinge regions: the «-helical kink 
at P294 and in the loop connecting the Ncrpase and the GTPase 
domain, at P32 (Fig. 4c). Consistently, superposition of GDP*AIF,— 
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GTPase-Cggp fusion GTPase-GTPase dimer partners indicates a 
smaller yet directionally equivalent opening of the BSE in one proto- 
mer’®. This proposed conformational change is feasible as there are 
relatively few contacts that hold the Nerpase helix to the body of the 
GTPase domain in our structure. In addition, the BSE links the GTPase 
domain to the stalk and interface 1 via a short turn, making it well 
placed to transmit conformational alterations. As previously noted, 
comparison of the nucleotide-free and GMPPCP-bound electron 
microscopic reconstructions indicates that the GMPPCP-bound helix 
has a relatively smaller diameter’’. In addition, the stalk density is more 
‘kinked’ in the GMPPCP-bound form!’”. When we dock our structure, 
we observed that the bases of the stalks do not fit the density of the 
GMPPCP-bound helix. However, where the fit becomes poor there are 
strongly conserved prolines in «2 and «4 and a partially conserved 
proline in «1° clustered in the stalk. We propose that these proline 
residues facilitate the formation of a kink at the stalk base in the 
GMPPCP-bound helix, which might allow interface 1 to form more 
fully in the assembled helix. 

As predicted by our model, GTP hydrolysis would induce closure of 
the BSE via a transition state represented by the structure of the 
GTPase domain-BSE fusion in the presence of GDPeAIF, , towards 
the conformation observed in our nucleotide-free structure (Fig. 5b). 
As GTP binding to dynamin is not rate limiting and GTP hydrolysis is 
stimulated by the formation of the GTPase-GTPase interface between 
adjacent rungs, it is likely that the BSE-dependent conformational 
change occurs in the context of short dynamin helical assemblies rather 
than within a helix consisting of many rungs**. Short assemblies could 
also result in an approximate temporally coordinated conformational 
change (Fig. 5c-e). The conformational changes we propose would 
cause disruption to the assembled helix and the underlying membrane 
via local rung shifts. Interface 1 in our structure, which buries a rela- 
tively small surface area in the assembled stalk lattice, will be especially 
susceptible to change by inter-rung GITPase-GT Pase dimer formation 
and its subsequent disruption by GTP hydrolysis, and these changes 
will be transmitted to interface 3 and the PH. The combined effects of 
curvature stress imposed by a short dynamin helical assembly coupled 
with PH insertion into the membrane is likely to destabilize the mem- 
brane and result in membrane fission’*. Given the strong similarities 
between dynamins and other DRP family members, the structure of 
dynamin and our proposed model will serve to guide studies on the 
mechanisms of action of DRPs in diverse cellular functions. 


METHODS SUMMARY 


The conserved Dynl APRD assembly-defective mutant G397D was identified by 
a cytological assay using the yeast mitochondrial fission DRP Dnm1-GFP. Dyn1 
APRD and Dyn1 G397D APRD were expressed in E. coli and were purified as 
described in Methods. Light scattering, sucrose density gradient centrifugation, 
mass determination, continuous GTPase assays and electron microscopy were 
performed as described in Methods. Crystals were grown by microbatch from 
3.2 ul droplets containing 52.5mM Tris/Cl pH 7.7, 175mM NaCl, 32.5mM 
NaNO3, 20% v/v PEG 400, 0.97mM f-mercaptoethanol and 31.914M Dynl 
G397D APRD and cryo-protected with Paretone-N. Reflection data were col- 
lected at 100K at Beamline 8.3.1 at the Advanced Light Source (Berkeley, 
California, USA) at a wavelength of 0.9488 A. Data collection and processing 
are described in Methods. The structure was determined by molecular replace- 
ment, using known structures of the nucleotide-free rat dynamin 1 GTPase 
domain (PDB accession no. 2AKA chain B), the human dynamin 1 PH domain 
(PDB accession no. 2DYN chain B) anda truncated form of the human MxA stalk 
(PDB accession no. 3LJB chain B) as sequential search models. Structure refine- 
ment is described in Methods. The stalk interface 2 dimer was fit into the previ- 
ously described cryo-electron microscopic reconstruction of GMPPCP-bound 
dynamin by applying tilt and twist to the sequential dimers from the linear 
filaments observed in the crystal to match the helical parameters described for 
the reconstruction. The previously described GTPase domain dimer formed in 
the presence of GDP*AIF, was subsequently fit into the density. Connection of 
the GTPase domain to the stalk required a conformational rearrangement of the 
BSE, which was independently fit into the visible density. The fit was subjected to 
rigid body refinement as described in Methods. 
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Gravitational redshift of galaxies in clusters as 
predicted by general relativity 


Radoslaw Wojtak', Steen H. Hansen! & Jens Hjorth! 


The theoretical framework of cosmology is mainly defined by gravity, 
of which general relativity is the current model. Recent tests of general 
relativity within the Lambda Cold Dark Matter (ACDM) model have 
found a concordance between predictions and the observations of 
the growth rate and clustering of the cosmic web’. General relativity 
has not hitherto been tested on cosmological scales independently of 
the assumptions of the ACDM model. Here we report an observation 
of the gravitational redshift of light coming from galaxies in clusters 
at the 99 per cent confidence level, based on archival data’. Our 
measurement agrees with the predictions of general relativity and 
its modification created to explain cosmic acceleration without the 
need for dark energy (the f{R) theory*), but is inconsistent with 
alternative models designed to avoid the presence of dark matter*®. 

According to the theory of general relativity’, light emitted from 
galaxies moving in the gravitational potential well of galaxy clusters is 
expected to be redshifted proportionally to the difference in gravita- 
tional potential ® between the clusters and an observer, that is, 
Zor = A@®/c’, where c is the velocity of light in vacuum. For typical 
cluster masses of ~ 10'*Mo, where Mg is the Sun’s mass, the gravita- 
tional redshift is estimated to be*!° CZgr ~ 10km s 1, which is around 
two orders of magnitude smaller than the Doppler shift owing to the 
random motions of galaxies in clusters. The method of disentangling 
the kinematic Doppler effect from gravitational redshift relies on the 
fact that the former gives rise to a symmetric broadening of the 
observed velocity distribution, whereas the latter shifts its centroid. 
A critical factor in detecting such a velocity shift is the number of 
galaxies with spectroscopically measured velocities and the number 
of galaxy clusters. Both should be sufficiently high to reduce the error 
due to the Doppler width of the velocity distribution and to eliminate 
the sensitivity to irregularities in cluster structure, such as substruc- 
tures and asphericity. 

The data are compiled from the SDSS’ Data Release 7 and the 
associated Gaussian Mixture Brightest Cluster Galaxy catalogue"! con- 
taining the positions and redshifts of galaxy clusters identified in the 
survey. The cluster sample is richness-limited with a threshold corres- 
ponding to a cluster mass of 10'4Mg. The mean, 5th percentile and 
95th percentile values of the cluster richness" are 16, 8, and 86 and 
correspond to cluster masses of around 2x 10'*Mo, 10'*Mo and 
10'°Mo, respectively. The typical number of spectroscopic redshifts 
per cluster (within a 6-megaparsec (Mpc) aperture and a velocity range 
of +4,000kms ! around the mean cluster velocities) varies from 10 
for low-richness clusters to 140 for the richest ones. 

Figure 1 shows the histograms of galaxy velocities calculated in four 
bins of the projected cluster-centric distance centred at 0.6, 1.6, 3.3 and 
5.2 Mpc. The cluster centres and redshifts were approximated by the 
coordinates and redshifts of the brightest cluster galaxies. The 
observed velocity distributions consist of two clearly distinct parts: a 
quasi-flat distribution of galaxies not belonging to the clusters 
(observed due to projection effect) and a quasi-Gaussian component 
associated with galaxies gravitationally bound to the clusters'*. The 
latter is expected to reveal the signature of gravitational redshift in 
terms of a systematic shift of its velocity centroid. Analysis of mock 


kinematic data generated from cosmological simulations shows that 
the number of redshifts and clusters is sufficient to reduce all expected 
sources of noise—such as substructures, cluster asphericity, and non- 
negligible offset between the brightest cluster galaxies and cluster 
centres’* (both in the position on the sky and redshift space)—and 
to allow for detection of gravitational redshift at a confidence level of 
nearly 30 (see Supplementary Information). 

We search for gravitational redshift by measuring the mean velocity 
A of the quasi-Gaussian component of the observed velocity distri- 
bution. We carry out a Monte Carlo Markov Chain analysis of the data 
using a two-component model for the velocity distribution which 
includes contributions from both the cluster and non-cluster galaxies 
(Supplementary Information). Constraints on the mean velocity are 
obtained by marginalizing the likelihood function over the set of 
nuisance parameters defining the shape of both components of the 
velocity distribution. The best-fitting models of the velocity distribu- 
tions are shown in Fig. 1 and the resulting measurements of the mean 
velocity as a function of the projected cluster-centric distance R are 
presented in Fig. 2. The obtained mean velocity is negative at all radii 
with a clear tendency to decline with increasing radius. The negative 
values arise from the fact that the rest frames of the clusters are defined 
by the observed velocities of the central galaxies. This choice of the 
reference frame implies that the gravitational redshift manifests itself 
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Figure 1 | Velocity distributions of galaxies combined from 7,800 SDSS 
galaxy clusters. The line-of-sight velocity (v,,) distributions are plotted in four 
bins of the projected cluster-centric distances R (indicated in the upper left 
corner for the distributions from top to bottom), and offset vertically by an 
arbitrary amount for presentation purposes. Red lines present the histograms of 
the observed galaxy velocities in the cluster rest frame and black solid lines show 
the best-fitting models. The model assumes a linear contribution from the 
galaxies that do not belong to the cluster and a quasi-Gaussian contribution 
from the cluster members (see Supplementary Information for more details). 
The cluster rest frames and centres are defined by the redshifts and the positions 
of the brightest cluster galaxies. The error bars represent Poisson noise. 
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Figure 2 | Constraints on gravitational redshift in galaxy clusters. The effect 
manifests itself as a blueshift A of the velocity distributions of cluster galaxies in 
the rest frame of their brightest cluster galaxies. Velocity shifts were estimated 
as the mean velocity of a quasi-Gaussian component of the observed velocity 
distributions (see Fig. 1). The error bars represent the range of 4 parameter 
containing 68% of the marginal probability and the dispersion of the projected 
radii in a given bin. The blueshift (black points) varies with the projected radius 
Rand its value at large radii indicates the mean gravitational potential depth in 
galaxy clusters. The red profile represents theoretical predictions of general 
relativity calculated on the basis of the mean cluster gravitational potential 
inferred from fitting the velocity dispersion profile using the assumptions of the 
most reliable anisotropic model of galaxy orbits (see Supplementary 
Information for more details). Its width shows the range of 4 containing 68% of 
the marginal probability. The blue solid and dashed lines show the profiles 
corresponding to two modifications of standard gravity: f(R) theory* and the 
TeVeS model?®. Both profiles were calculated on the basis of the corresponding 
modified gravitational potentials (see Supplementary Information for more 
details). The prediction for f(R) represents the case that maximizes the 
deviation from the gravitational acceleration in standard gravity on the scales of 
galaxy clusters. Assuming isotropic orbits in fitting the velocity dispersion 
profile lowers the mean gravitational depth of the clusters by 20%. The resulting 
profiles of gravitational redshift for general relativity and f(R) theory are still 
consistent with the data and the discrepancy between prediction of TeVeS and 
the measurements remains nearly the same. The arrows show characteristic 
scales related to the mean radius r, of the virialized parts of the clusters. 


as a blueshift'® (negative mean velocity) varying with the projected 
cluster-centric distance from 0 at the cluster centre to —|®(0)|/c at 
large projected radii R. 

The detection of gravitational redshift is significant at the 99% con- 
fidence level. The integrated signal within the 6-Mpc aperture amounts 
to A= —7.7 £3.0kms 1, which is consistent with the gravitational 
potential depths of simulated galaxy clusters of A = —(5-10) kms! 
(ref. 10). A more quantitative comparison with theoretical predictions 
requires explicit information about the mean gravitational potential 
profile and the distribution of cluster masses in the sample. We make 
use of the velocity dispersion profile of the composite cluster to con- 
strain both functions. Then we calculate the gravitational redshift in 
terms of the mean velocity A by convolving the individual profiles of 
the clusters with their mass distribution (Supplementary Information). 
The resulting profile (red profile in Fig. 2; see also discussion on the 
effect of the anisotropy of galaxy orbits in Supplementary Information) 
is fully consistent with the gravitational redshift inferred from the 
velocity distributions. The fact that the same gravitational potential 
underlies galaxy motions and gravitational redshift of photons in clusters 
provides observational evidence of the equivalence principle on the scale 
of galaxy clusters. 

We confront the obtained constraints on gravitational redshift with 
the predictions of alternative theories of gravity. We consider two 
popular models of gravity, the tensor-vector-scalar (TeVeS) theory*® 
and the f(R) model’, designed to alleviate the problem of dark matter or 
to recover the expansion history of the Universe, respectively. 
Theoretical profiles of gravitational redshift are calculated using the 
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relations between the generalized gravitational potentials of these 
models and the Newtonian potential (Supplementary Information). 
The Newtonian potential is inferred from the observed velocity dis- 
persion profile of the composite cluster using the assumptions of the 
most reliable anisotropic model of galaxy orbits (see Supplementary 
Information for more details), and constitutes the reference basis for 
the calculations. For TeVeS we assume that the total masses of galaxy 
clusters make up 80% of those recovered under assumption of the 
Newtonian gravity. This factor lowers the ratio of dynamical-to- 
baryonic mass in galaxy clusters to the value resulting from fitting 
Modified Newtonian Dynamics* (to which TeVeS is a relativistic 
generalization) to cluster data’*. The resulting profile of gravitational 
redshift does not match the data, deviating from the observations at 
the 95% confidence level (the blue dashed line in Fig. 2). This discrepancy 
increases with projected radius and is mostly caused by a logarithmic 
divergence of the scalar field in the regime of small accelerations, that is, 
g<ayand ay ~ 10° '°ms ’, which is responsible for a 1/r modification 
of the gravitational acceleration. This result points to a critical problem 
for the TeVeS (or Modified Newtonian Dynamics) model in recovering 
the true gravitational potential at large distances around the cluster 
centres. Considering the f(R) model, we choose the least favourable set 
of free parameters maximizing the departure from Newtonian gravita- 
tional acceleration’*. Despite this choice, the resulting profile of gravita- 
tional redshift is consistent with the data (the blue solid line in Fig. 2). 
The obtained constraints on gravity are consistent with recent tests 
verifying the concordance between gravity, the cosmological model and 
observations of the large-scale structure of the Universe'*. An important 
advantage of using the gravitational redshift effect is that this method 
does not depend on cosmology (see also Supplementary Information) 
allowing to probe gravity in a direct way. In particular, this implies that 
the discrepancy between TeVeS theory and the observations’ is unlikely 
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Figure 3 | The measured-to-predicted ratio of the gravitational redshift. 
The figure shows the results of different observations or experiments as a 
function of the spatial scale of the gravitational potential well. Blue and red 
symbols refer to detections of gravitational redshift z, in: a ground-based 
experiment’® (blue circle), observations of the Sirius B white dwarf'®”° (blue 
triangles), a space-based experiment”’ (blue square), an observation of the Sun’* 
(blue star), and analysis of the cluster data reported in this work (red circle). All 
measurements are divided by the predictions of general relativity (filled 
symbols). Results obtained for galaxy clusters are also compared with the 
predictions of f(R) theory and TeVeS model (red open symbols). As a measure 
of gravitational redshift in galaxy clusters we used the signal integrated within 
the aperture of 6 Mpc. The green square and circle show the measurement of 
the rate of growth of cosmic structure y (ref. 1) and the probe of gravity E,, 
which combines the properties of galaxy-galaxy lensing, galaxy clustering and 
galaxy velocities’. Both results are compared with the prediction of general 
relativity with a standard ACDM cosmological model. All error bars represent 
standard deviations. The relative accuracy of the measurement from the space- 
borne experiment”! is beyond the resolution of the plot and amounts to 107 *. 
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to be a consequence of a specific choice of cosmological parameters, but 
indeed points to the inadequacy of this model to describe the Universe 
on very large scales. 

Our results complement a series of experiments and observations 
aimed at confirmation of the predicted gravitational redshift on dif- 
ferent scales of the Universe. Figure 3 shows a summary of the detec- 
tions in terms of the relative accuracy of the measurements as a 
function of the scale of the gravitational potential well. The positions 
of data points vary from 20m, for the first ground-based experi- 
ment'®’’, to the 1-10-Mpce scale for galaxy clusters. On the scale of 
the solar radius we plot the measurement of gravitational redshift for 
the Sun'’, and on a scale two orders of magnitude smaller we plot 
constraints from the observations of the Sirius B white dwarf'?”° and 
a space-borne hydrogen maser”'. These results make gravitational red- 
shift the only effect predicted by general relativity that has been con- 
firmed on spatial scales spanning 22 orders of magnitude. To study this 
effect in more detail requires a larger redshift sample and therefore will 
be possible with the advent of the next-generation redshift surveys, 
such as the EUCLID satellite. 
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The excitation of solar-like oscillations in a 6 Sct star 
by efficient envelope convection 
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Delta Scuti (5 Sct)’ stars are opacity-driven pulsators with masses 
of 1.5-2.5Mo, their pulsations resulting from the varying ioniza- 
tion of helium. In less massive stars? such as the Sun, convection 
transports mass and energy through the outer 30 per cent of the 
star and excites a rich spectrum of resonant acoustic modes. Based 
on the solar example, with no firm theoretical basis, models predict 
that the convective envelope in 6 Sct stars extends only about 1 per 
cent of the radius’, but with sufficient energy to excite solar-like 
oscillations**. This was not observed before the Kepler mission®, so 
the presence of a convective envelope in the models has been 
questioned. Here we report the detection of solar-like oscillations 
in the 5Sct star HD 187547, implying that surface convection 
operates efficiently in stars about twice as massive as the Sun, as 
the ad hoc models predicted. 

Thirty days of continuous observations of HD 187547 (KIC 7548479) 
by the Kepler mission with a cadence of 1 min led to its identification asa 
6 Sct pulsator (Fig. 1a, b). In contrast to the non-uniformly distributed 
signals at low frequencies, the observed regularly spaced peaks at high 
frequencies (Fig. 1c) suggest that we also observe high-radial-order over- 
tones as expected for stochastically excited solar-like oscillations. For 
such oscillations the observed comb-like frequency structure (with the 
large frequency separation Av indicating the frequency separation 
between consecutive radial overtones of like degree) is the result of 
mainly radial and dipolar pulsation modes, whereas for 6 Sct stars it 
is not clear which modes are excited to observable amplitudes. The 
strikingly broadened structures observed only at high frequencies 
(Figs 1f and 2b, c) suggest that each is due either to single damped 
and stochastically re-excited oscillations or to very close unresolved 
frequencies of coherent oscillations. 

Here we use spectroscopic observations to derive an effective tem- 
perature Teg = 7,500 + 250K, a surface gravity of logg = 3.90 + 0.25 
dex (c.g.s.) and a projected rotational velocity of vsini = 10.3 + 2.3 
kms (see Supplementary Information for details). We identify 
HD 187547 as an Am star from chemical element abundance analysis, 
which is consistent with the observed low v sin i typical for these stars. 
Am stars are stars of spectral type A showing atmospheric under- 
abundance when compared with the Sun in the chemical elements 
Sc and Ca, and an overabundance of Ba, Sr and Y (ref. 7). We compute 
a photospheric metallicity (all elements except H and He) of Z = 0.017, 
which is larger than the solar value of Z = 0.0134 (ref. 8). 

About two-thirds of Am stars are primary components of spectro- 
scopic binary systems’. The Am phenomenon is connected to slow 


rotation, which is not common in A type stars. Binarity is believed to 
act as a braking mechanism slowing down the rotation and allowing 
spectral peculiarities to occur as a result of element diffusion’®. 
Pulsating Am stars still represent a challenge to theory, because He 
is expected to settle gravitationally and should only partly be present in 
the Het! ionization zone where the 6 Sct pulsations are excited. In 
other words, theoretical models predict that the hottest and youngest 
A-type stars should not pulsate’®, which is in contradiction with recent 
observations’’. As the stars evolve, their convective envelopes deepen 
and efficiently mix the stellar matter, erasing the observed chemical 
peculiarities in the atmospheres, allowing the opacity mechanism to 
drive pulsation in the He! ionization zone. Using the observed solar- 
like oscillations reported here, the depth of the convective envelope can 
be derived (hence the mixing length), probing the diffusion of He and 
heavy elements in this star. This will contribute significantly to revising 
the interaction between pulsation and diffusion in models of Am stars. 

Seven radial velocity measurements of HD 187547, spread over 
153 days, give no evidence for a short-period binary system. In addi- 
tion, the absence of any detectable contribution by a potential close 
companion to the spectrum implies a considerably less luminous star 
of spectral type G or later. The expected amplitudes and frequency of 
maximum oscillation power for such a star are inconsistent with the 
observations, leading to the conclusion that the signal observed in 
Fig. 1c cannot originate from a companion. The observed amplitude 
spectrum of HD 187547 is not affected by a background star because 
the fraction of light in the aperture from neighbouring stars is only 
1.5%. Other chemically peculiar pulsating stars situated, as the 6 Sct 
stars, in the classical instability strip in the Hertzsprung—Russell dia- 
gram” are the rapidly oscillating Ap stars. Their high-radial-order 
pulsation modes are triggered by the opacity mechanism acting in 
the hydrogen ionization zone, often showing equidistant multiplets 
in the frequency spectrum as a result of the alignment of the pulsation 
axes with strong magnetic fields'’*. The strong magnetic fields as seen in 
rapidly oscillating Ap stars are, however, not observed in Am stars!*, 
We therefore exclude the possibility that HD 187547 is a hybrid of a 
6 Sct and a rapidly oscillating Ap star. 

In Fig. 3 we show an échelle diagram comparing the observed 
frequencies with a model of a star similar to HD 187547, demon- 
strating again the clear structures separated by Av at high frequencies 
and the non-structured distribution at lower frequencies. For the 
high-frequency modes we derive a mean large frequency separa- 
tion Av of 405+0.6uHz. Using the empirical relation’ 
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Figure 1 | Fourier amplitude spectra of the Kepler light curve of 

HD 187547. a, Fourier spectrum covering the entire frequency range in which 
significant signals were observed with a dominant frequency at 251 1Hz and an 
amplitude of 2 mmag, typical for a 6 Sct star. b, The multimode oscillations of 
HD 187547 are shown by subtracting 16 sinusoids corresponding to the most 
prominent oscillations, revealing a large number of additional significant 
frequencies. c, The region between 500 and 850 tHz shows a clear pattern of 
roughly equally spaced peaks, which we interpret as high-order consecutive 
radial overtones. The comb-like structure expected for high-order radial 
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Figure 2 | Time-Fourier spectrum. Here we again highlight the difference in 
temporal variability between the modes interpreted as stochastic modes and the 
coherent, opacity-driven peaks at low frequencies. The time-fourier spectrum 
was computed with a running filter of full-width at half-maximum = 5 days, 
comparable to the mean mode lifetime. a, An opacity-driven mode (the same as 
in Fig. le) showing temporal stability in the 5 Sct frequency region. b, Stochastic 
mode observed in HD 187547, showing an erratic behaviour as expected for 
solar-like oscillations (the same as in Fig. 1f). ¢, For comparison, a stochastic 
oscillation mode observed in the Sun. The solar data were obtained from the 
SOHO VIRGO instrument. The data set has the same length and sampling as 
for HD 187547; that is, 30 days and 1 min, respectively. Further details of 
frequency analyses and tests on artificial data sets (Supplementary Fig. 1) to 
verify our interpretation are in the Supplementary Information. 


overtones is clearly visible. The broadened peaks suggest damped/re-excited 
solar-like oscillations. The black arrows denoted Av indicate the large 
separation between consecutive radial and dipole modes. d, Spectral window. 
The shape of the window function is defined by the length and sampling of the 
data set. Any coherent signal will have the same profile. e, Example for one of 
the modes driven by the opacity mechanism in HD 187547. f, A supposed solar- 
like oscillation mode observed in HD 187547, displaying a broadened structure 
suggestive of a short mode lifetime. 


Av = (0.263 + 0.009) tHz(Vmax fHHz*)°772=°°° we obtain a fre- 
quency of maximum power Vmax = 682743 Hz. This is in very good 
agreement with the highest-amplitude mode in the supposed stochastic 
frequency region at 696 \Hz. The possibility that what we observe is 
0.5Ayv in the frequency spectrum is ruled out because this would require 
a Vmax at about 1,673 Hz, where no signal is observed. We can also 
exclude the observation of 2Av because that would place Vmax at 
277 \tHz, close to the dominant opacity-driven mode at 251 Hz. 

The amplitudes of solar-like oscillations are determined by the inter- 
action between driving and damping defined by different physical pro- 
cesses’, such as modulation of the turbulent momentum and heat fluxes 
by pulsation. The exact contribution to driving and damping by each of 
these processes is still not well understood, resulting in uncertainties in 
the predictions of the stochastically excited mode amplitudes"®, particu- 
larly in hotter stars in which the convective envelopes are shallow. We 
expect the mixing length, the amplitudes and mode lifetimes to con- 
strain the anisotropy of the convective velocity field, parameters that all 
semi-analytical convection models rely on’”. 

For HD 187547 we measure a peak-amplitude per radial mode’® for 
the assumed stochastic signal of 56 + 2 p.p.m., which after bolometric 
correction” results in 67 + 3 p.p.m. (see Supplementary Information 
for details). From the empirical scaling relation” and using a bolometric 
solar peak-amplitude of 3.6 p.p.m. (ref. 21) we obtain a predicted peak 
amplitude of A= 14+9p.p.m. The mean mode lifetimes are mea- 
sured” as 5.7 + 0.8 days. Empirical relations predict a mode lifetime 
for a star with Tog = 7,500 + 250 K of the order of one day” or shorter”, 
which is not in agreement with what we measure for HD 187547. 
However, these scaling relations (for amplitude and mode lifetimes) 
are based on few observed stars, and none of them is calibrated in the 
temperature domain of our target, for which the physical conditions in 
the convection zone are expected to be very different. Furthermore, 
given that HD 187547 is metal overabundant in comparison with the 
Sun, the observed amplitude is expected to be higher*”* than predicted 
from simple scaling, which is indeed the case. The power of a mode is 
directly proportional to the mode lifetime provided that the energy 
supply rate over the mode inertia is constant”’, which further supports 
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Figure 3 | Echelle diagram of HD 187547. Here we plot 69 extracted 
frequencies as a function of frequency modulo the large separation 

(Av = 40.5 wHz). Frequencies equally spaced by Av will form vertical ridges in 
the échelle diagram. To guide the eye, we show theoretically predicted 
frequencies of pulsation modes for a 1.85Mq stellar model. Ridges of |= 0 
modes are represented by open circles, ! = 1 by open triangles and / = 2 by open 
diamonds. Detailed modelling of the star is beyond the scope of this paper. The 
supposed solar-like modes (filled squares) between 500 and 870 Hz show clear 
ridges, as expected for high-order acoustic oscillations similar to what is 
observed in solar-like stars. The lower frequencies that we attribute to 6 Sct 
pulsation (filled circles) excited by the opacity mechanism show no obvious 
regular patterns. 


the higher amplitude because the observed mode lifetimes are also 
longer than expected. An additional factor that is not considered in 
any scaling relations is the chemical peculiarity of our target. In sum- 
mary, these factors make HD 187547 an intriguing case for further 
theoretical analyses of stochastic oscillations and the potential inter- 
action with the opacity mechanism in 6 Sct stars. 

The amplitude distribution for stochastic pulsation can be described 
as a Rayleigh distribution, provided that the examined time series are 
much shorter than the mode lifetimes. The relation between the mean 
amplitude (A) and its standard deviation o(A) can then be written as” 
(4/ — 1)°°(A) ~ 0.52(A). This is not valid for opacity-driven pulsa- 
tion. For HD 187547 we therefore expect to obtain two different 
regimes of the ratio o(A)/(A) for the two groups of oscillation modes 
(Supplementary Fig. 2). Indeed, we see that the 6 Sct frequencies have a 
lower value of o(A)/(A) than the supposed solar-like modes, giving 
further evidence for the stochastic nature of the latter (see Supplemen- 
tary Information for details). 

We cannot strictly exclude the possibility that the signals between 
578 and 868 Hz are due to unresolved modes of pulsation excited by 
the opacity mechanism, because high-radial-order acoustic modes can 
also be observed in hot 6 Sct stars. Nevertheless, as shown in Fig. 1 this 
would imply that 6 Sct pulsation covers the region between 205 and 
870 Hz continuously. According to current theory, the opacity mech- 
anism acting in the He 1! ionization zone cannot excite modes spanning 
16 radial orders for a star with parameters like those of HD 187547 
(ref. 28). Further support for the discovery of solar-like oscillations 
comes from spectroscopic observations that also indicate the presence 
of convective motions in the atmospheres of A and Am stars”. In 
addition, signatures of granulation noise in dSct stars have been 
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reported from photometric measurements®. Opacity-driven pulsations 
are also observed in more massive stars (8-16M >), known as B Cephei 
stars (in this case the opacity mechanism acts in the ionization region 
of the iron-group elements). The unexpected detection of solar-like 
oscillations in such a star*® (with a mass of 10M) suggests that 
both types of pulsation, opacity-driven and stochastically excited, 
can coexist and can have overlapping frequency domains. The similar 
timescales of the different oscillation types imply a possible interaction 
between the two mechanisms. 
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High-fidelity projective read-out of a solid-state spin 


quantum register 
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Initialization and read-out of coupled quantum systems are essential 
ingredients for the implementation of quantum algorithms’. 
Single-shot read-out of the state of a multi-quantum-bit (multi- 
qubit) register would allow direct investigation of quantum correla- 
tions (entanglement), and would give access to further key resources 
such as quantum error correction and deterministic quantum tele- 
portation’. Although spins in solids are attractive candidates for 
scalable quantum information processing, their single-shot detec- 
tion has been achieved only for isolated qubits*°. Here we demon- 
strate the preparation and measurement of a multi-spin quantum 
register in a low-temperature solid-state system by implementing 
resonant optical excitation techniques originally developed in 
atomic physics. We achieve high-fidelity read-out of the electronic 
spin associated with a single nitrogen-vacancy centre in diamond, 
and use this read-out to project up to three nearby nuclear spin 
qubits onto a well-defined state’. Conversely, we can distinguish 
the state of the nuclear spins in a single shot by mapping it onto, 
and subsequently measuring, the electronic spin**. Finally, we show 
compatibility with qubit control: we demonstrate initialization, 
coherent manipulation and single-shot read-out in a single experi- 
ment on a two-qubit register, using techniques suitable for exten- 
sion to larger registers. These results pave the way for a test of 
Bell’s inequalities on solid-state spins and the implementation of 
measurement-based quantum information protocols. 

The electronic spin of the nitrogen—vacancy centre (NV) in dia- 
mond constitutes an exceptional solid-state system for investigating 
quantum phenomena, combining excellent spin coherence” with a 
robust optical interface’*"*. Furthermore, the host nitrogen nuclear 
spin (typically '4N, with nuclear spin I= 1) and nearby isotopic 
impurity ‘°C nuclei (I= 1/2) have hyperfine interactions with the 
NV’s electronic spin (S=1), allowing development of few-spin 
quantum registers that have been suggested as building blocks for 
quantum repeaters’’, cluster state computation’® and distributed 
quantum computing”’. All of these applications require high-fidelity 
preparation, manipulation and measurement of multiple spins. There 
have been significant advances in coherent control over few-spin sys- 
tems in diamond””’, but no method exists for the simultaneous pre- 
paration”” and single-shot read-out® of multi-spin registers, which 
impedes progress towards multi-qubit protocols. Here we remove this 
obstacle by exploiting resonant excitation techniques, as pioneered in 
atomic physics**”*, in microstructured diamond devices that allow 
high photon collection efficiency (Fig. 1a). These new methods enable 
us to initialize multiple nuclear spin qubits and to perform single-shot 
read-out of a few-qubit register, clearing the way towards implementa- 
tion of quantum algorithms with solid-state spins. 

Our preparation and read-out techniques rely on resonant excita- 
tion of spin-selective optical transitions of the NV, which can be 
spectrally resolved at low temperatures**. We use the E, and A; 
transitions in our experiments (Fig. 1b): A, connects the ground states 
with spin projection ms = +1 to an excited state with a primarily 


ms = £1 character, whereas E, connects states with ms = 0. A typical 
spectrum of NV A, one of the two NVs we study, is shown in Fig. 1c 
(see Supplementary Information for NV B). Under resonant excitation 
of a single transition, the fluorescence decays with time owing to a 
slight spin mixing within the excited states that induces shelving into 
the other spin state (Fig. 1d). This optical pumping mechanism allows 
high-fidelity spin state initialization’*’’: from the data in Fig. 1d, we 
estimate a preparation error into the ms=0 ground state of 
0.3 + 0.1%, which is a drastic reduction of the 11 + 3% preparation 


a 150 b : 
10 eS — } 
86 ; a 
+ ae 3 
22 a A, 
0.1 Me 
Ee, rod ‘ 
BN \ pe 
Ps 
mg=0 mg= +1 
c d 
115 ee 
NVA 
110 


\s 
-~ 


a 
onanon 


Intensity (kct s~') 


0 2 4 6 8 10 12 14 0 10 20 30 40 50 


Laser detuning (GHz) Time (us) 


Figure 1 | Resonant excitation and electronic spin preparation of a 
nitrogen-vacancy centre. a, Scanning electron microscope image of a solid 
immersion lens representative of those used in the experiments (for details, see 
Supplementary Information). The overlaid sketch shows the substitutional 
nitrogen and the adjacent vacancy that form the NV. Inset, scanning confocal 
microscope image of NV A (logarithmic colour scale). kct, 1,000 counts. 

b, Energy levels used to prepare and read out the NV’s electronic spin (S = 1 in 
the ground and optically excited states); transitions are labelled according to the 
symmetry of their excited states. Dashed lines indicate spin-non-conserving 
decay paths. MW, microwave transition. c, Photoluminescence excitation 
spectrum of NV A; frequency is given relative to 470.443 THz. d, Fluorescence 
time trace of NV A, initially prepared in ms = 0 (E, excitation, 4.8-nW power) 
and ms = + 1 (A, excitation, 7.4-nW power; inset), with a saturation power 
Psat ~ 6 nW. Spin flips in the excitation cycle lead to nearly exponential decay of 
fluorescence, with fitted spin-flip times of 1/)p = 8.1 + 0.1 pts for E, and 

0.39 + 0.01 ts for A;, and initial respective intensities of 740 + 5 and 

95 + 2kcts ', giving a lower limit of 99.7 + 0.1% to the mg = 0 preparation 
fidelity and one of 99.2 + 0.1% to the ms = £1 preparation fidelity. The low 
initial intensity for A, is associated with a fast intersystem crossing to 
metastable singlet states (Supplementary Information). 
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error observed with conventional off-resonant initialization (Sup- 
plementary Information). 

Spin-dependent resonant excitation also allows single-shot elec- 
tronic spin read-out: the presence or absence of fluorescence under 
E, excitation reveals the spin state. By working with low-strain NVs at 
low temperature (T= 8.6K), we suppress spin mixing’?* and 
phonon-induced transitions” within the excited states, extending 
the spin relaxation time under E,, excitation to several microseconds. 
Together with a high collection efficiency due to the use of solid 
immersion lenses”? fabricated around pre-selected, low-strain NVs, 
and efficient rejection of resonant excitation from the measured 
phonon-sideband emission, this highly spin-preserving transition 
allows the detection of several photons before the spin flips. 

We demonstrate single-shot read-out by initializing the electronic 
spin to have ms = 0 or ms = +1, followed by resonant excitation of the 
E,, read-out transition for t,. = 100 [us (Fig. 2a). The resulting histo- 
grams of the number of detected photons are given in Fig. 2b. As 
expected, for ms = +1 we observe negligible excitation, with a 98.3% 


a Optical pumping 
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probability of not measuring any photons during the probe interval. By 
stark contrast, after initializing the spin to have ms = 0 we detect an 
average of (no) = 6.4 photons per shot. We assign the state ms = 0 to 
detection of one or more photons, and ms = +1 to the detection of no 
photons. After truncating our integration window to the optimal dura- 
tion of 40 pls, we find an average fidelity of 


Fag = 5 Fnsa0+Fng=t1) =93.2+0.5% 
where F,,,, is the probability of obtaining the measurement outcome 
msg after optical pumping into ms. To verify that these measurement 
outcomes indeed correspond to the electronic spin states, we use 
single-shot read-out to observe spin Rabi oscillations and microwave- 
induced quantum jumps” (Fig. 2d,e). 

Whereas the full read-out optically pumps the spin, shorter read- 
out durations can be non-destructive, albeit at lower fidelity. By 
optimizing integration windows, we obtain a fidelity of 83.4 + 0.5% 
for each of two successive read-out segments (Fig. 2c). Correlations 
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Figure 2 | Projective single-shot read-out of the NV’s electronic spin. 

a, Pulse sequence used for electronic spin read-out: after charge initialization 
(532 nm; not shown) the electron is pumped into the ms = 0 (A,, dark red) or 
ms = +1 (E, bright red) state, and then undergoes optional microwave spin 
manipulation and spin read-out by a pulse resonant with E,. Conditioning on 
simultaneous resonance during the final charge- and detuning-sensing stage 
eliminates effects of local electric field fluctuations or ionization 
(Supplementary Information). b, Statistics of photon counts detected during a 
tro = 100-fs electronic spin read-out after initialization into the respective 
ms = +1 (red) and mg = 0 (superimposed light grey and inset) states, obtained 
from 10,000 measurement repetitions. c, When the 100-ms read-out pulse is 
divided into two read-out segments, R1 and R2, with a variable division point, 
the fidelity of two consecutive segments reaches 83.4 + 0.5% for an optimal 


Continuous spin read-out 


Charge read-out 


division time of 5.5 1s; the probability of identical sequential outcomes is 
82.0 + 0.7%. Error bars (2s.e., n = 10,000) are smaller than the symbols. 

d, Electronic spin Rabi oscillations between ms = 0 and ms 1 at axial 
magnetic field B, ~ 15 G (purple): each data point comes from 1,000 single- 
shot read-out repetitions. The fit, which includes the detailed hyperfine level 
structure, yields a visibility of 78 + 8%, where a maximum of 84% can be 
expected. Blue data points show the measurement outcome after projection 
into ms = 0 by selecting only read-out events with photons detected within the 
first 400 ns (Supplementary Information). All errors and error bars, 2 s.e. 

e, Quantum jumps in the fluorescence time trace during continuous spin read- 
out. Durations of dark periods depend on the microwave Rabi frequency 
(Supplementary Information). Blue data indicates the counts per 5-1s read-out 
bin, and the deduced spin state is shown in orange. 
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between measurement outcomes indicate that the read-out is projec- 
tive. Following preparation of a superposition of spin states, we con- 
dition on detection of at least one photon (that is, measurement 
outcome ms = 0) during a first short read-out pulse, and probe the 
resulting spin state with a second read-out (Fig. 2d, blue data points). 
Regardless of the initial spin state, we observe a constant high prob- 
ability of obtaining ms = 0 in the second read-out. This shows that the 
read-out method is strongly projective and well suited for application 
in measurement-based quantum protocols. 

We use projective read-out of the electronic spin in combination 
with quantum gate operations for initialization and read-out of a few- 
qubit nuclear spin register. We first demonstrate the concept of mea- 
surement-based preparation on a single nuclear qubit. The electronic 
spin resonance spectrum for NV B (Fig. 3a, green trace) reveals the 
coupling to the host J = 1 '*N nuclear spin: two partly overlapping sets 
of three hyperfine lines correspond to the ms=0<s-1 and 
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Figure 3 | Nuclear spin preparation and read-out. a, Measurement-based 
preparation of a single '“N nuclear spin. In Earth’s ambient magnetic field of 
~0.5G, without nuclear spin polarization we observe four resonances in the 
hyperfine spectrum (green trace) for NV B; the outer two correspond to the 
nuclear spin state with m,; = —1 and the central two are combinations of the 
states with m, = {0, +1}. PL, photoluminescence intensity. Red and brown 
traces indicate transitions to the ms = +1 and ms = —1 states, respectively. As 
indicated in the circuit diagram, to initialize the nuclear spin we entangle it with 
the electronic spin and then read out the latter; p denotes the number of 
preparation steps, each of which is one iteration of the section of the circuit 
diagram enclosed in parentheses. Mona indicates the conditioning 
measurement, M, the electron spin read-out and U,;w the microwave spin 
manipulation. Data for the m; = —1 preparation is shown in orange; data for 
the m; = {0, +1} preparation is shown in cyan. Fits to Gaussian spectra show an 
amplitude ratio of 96 + 4% in the desired nuclear spin state. b, Measurement- 
based preparation of a three-nuclear-spin register. Using a similar sequence 
(circuit diagram), we prepare a well-defined state for all three nuclear spins. A 
portion of the uninitialized hyperfine spectrum (green) contains 12 partly 
superposed lines, of which we prepare the single line corresponding to 
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ms =0€ +1 electronic spin transitions, Zeeman-split by ~2 MHz 
in Earth’s magnetic field. The outermost transitions are associated with 
a specific nuclear spin state with spin projection m;, for example (ms, 
m)) = (0, -1)<(—1, —1) at 2.874 GHz. Our initialization procedure 
works as follows (Fig. 3a, circuit diagram). First we prepare the elec- 
tronic spin in ms = +1. We then perform a nuclear-spin-controlled 
NOT operation on the electronic spin by applying a m-pulse at 
2.874 GHz; this operation rotates the electronic spin into ms = 0 only 
when m, = —1. Finally we read out the electronic spin for 400 ns. If one 
or more photons are detected during this interval, the two-spin system 
is projected into (ms, m ) = (0, —1). Alternatively, if we run the same 
protocol with initial electronic spin state ms = 0, we prepare the nuc- 
lear spin with m, = {0, +1}. 

The efficiency of the nuclear spin initialization is evidenced by its 
drastic effect on the electronic spin resonance spectrum (Fig. 3a). 
Whereas before preparation the depths of the different hyperfine lines 
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my = (—1, 1/2, 1/2) (orange). Gaussian fits constrained to known hyperfine 
splittings yield an amplitude ratio of 88 + 10%. The observed visibility can be 
improved by performing two preparation steps and electronic spin repumping 
(p = 2; five red data points), yielding a contrast of 82% of the expected visibility 
from known read-out fidelity (Supplementary Information). Uncertainties and 
error bars, 2 s.e. (n = 1,000 for a and b with p = 1; selected from 10,000 
measurement runs for b with p = 2). c, Single-shot measurement of the '*N 
nuclear spin, preceded by two preparation steps (p = 2). Read-out (three 
repetitions) conditioned on successful preparation distinguishes m; = —1 
(orange; threshold <1 count) from m; = {0, +1} (cyan) with an average fidelity 
of 92 + 2% (Supplementary Information). d, Multiple-nuclear-spin read-out. 
Using a sequence similar to that in c (inset), we distinguish one of the 12 
hyperfine states associated with NV A. To prepare nuclear spin states, we 
perform the read-out procedure seven times and keep only data with zero total 
counts (identified as m; = (—1, 1/2, 1/2)) or =2 counts per initialization step 
(other states). Subsequent read-out with six repetitions (m; = (—1, 1/2, 1/2); 
state discrimination threshold, <3 total counts) achieves a 96.7 + 0.8% average 
fidelity for preparation and detection of the nuclear spin configuration. 
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indicate an equal mixture of the nuclear spin states (green trace), after 
preparation only the hyperfine lines corresponding to the prepared 
states are visible (m; = —1 for the orange trace and m,= {0, +1} for 
the cyan trace). 

The same nuclear spin initialization scheme can be applied to multi- 
qubit registers. Figure 3b shows the electronic spin resonance spec- 
trum of NV A (green trace), whose electronic spin is coupled to both 
the host '*N nuclear spin and two nearby '*C nuclei (Supplementary 
Information). The lowest-frequency line corresponds to a single state 
of the three nuclear spins. A m-pulse on this transition therefore imple- 
ments a triple-controlled NOT operation on the electronic spin 
(Fig. 3b, circuit diagram), allowing the initialization of all three nuclear 
spins (Fig. 3b, orange trace). The initialization can be further improved 
by repeating the preparation step (Fig. 3b, red data points). 

The nuclear qubits can be read out in a single shot by applying a 
nuclear-spin-controlled NOT operation to the electronic spin and 
subsequently reading out the electronic spin (Fig. 3c, d, inset circuit 
diagrams). Because the back-action of the electronic spin measure- 
ment on the nuclear spin is weak, we can repeat the process to obtain 
higher read-out fidelity>*. Figure 3c compares the resulting photon 
statistics for NV B after initialization into the single nuclear spin state 
m,= —1 with those obtained for m; = {0, +1}, and indicates an aver- 
age read-out fidelity of 92 + 2%. This number is a lower bound on the 
true read-out fidelity, as it includes errors in state preparation. 

A straightforward extension of this scheme can be used to read out 
the complete state of a register of multiple nuclear spins. By using a 
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multiply controlled NOT gate in the read-out sequence, we can mea- 
sure in a single shot whether the register is in a particular configura- 
tion. We demonstrate this procedure on NV A, where we identify the 
three-nuclear-spin state my = (—1, 1/2, 1/2) (Fig. 3d and Supplemen- 
tary Information). The other possible configurations can be probed by 
sequential application of this read-out scheme to different spectrally 
resolved hyperfine transitions, or, alternatively, by systematically 
flipping the nuclear spin qubits and repeating the read-out on the same 
hyperfine transition. 

Electronic-nuclear flip-flop processes in the optically excited state, 
which reduce the nuclear spin read-out fidelity, pose a significant 
hurdle to scaling the read-out to more qubits. Critically, resonant 
read-out allows selection of which states undergo optical excitation. 
By starting with the electronic spin in an ms = +1 state, optical excita- 
tion will occur only when the register is in the state being probed; 
therefore, no optically induced nuclear spin flips will occur during 
measurement of any of the other states. Thus, by contrast with schemes 
depending on off-resonant excitation where each additional read-out 
step degrades the fidelity, resonant excitation allows scaling of high- 
fidelity read-out to larger registers. 

Finally, we demonstrate the compatibility of all the different tech- 
niques discussed here by implementing them in a single experiment: 
we initialize, coherently manipulate and then read out a two-qubit 
register consisting of the electronic spin and '*N nuclear spin of NV 
B. After initializing it to have (ms, m;) = (0, —1), we rotate the nuclear 
spin using a radio-frequency pulse and subsequently rotate the 
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Figure 4 | Initialization, manipulation and read-out of a two-qubit register. 
a, After initialization of NV B into the (ms, m;) = (0, —1) state, we use radio- 
frequency excitation (spin rotation Rpy at 4.9464 MHz) to drive the nuclear 
spin and then microwaves (spin rotation Ryw at 2.8774 GHz) to drive the 
electronic spin. The electronic spin state is subsequently measured for 15 Ls, 
and this is followed by five read-out steps (each of 10 pts) of the ‘*N nuclear spin 
state. b, Probability of observing ms = 0 conditional on the measured nuclear 
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spin state and averaged over all radio-frequency pulse durations, as a function 
of microwave pulse duration (Supplementary Information). ¢, Probability of 
observing m; = —1 conditional on the observed electronic spin state and 
averaged over all microwave pulse durations, as a function of radio-frequency 
pulse duration (Supplementary Information). Error bars and uncertainties, 
2s.e; data in a are based on 1,000 measurements per pixel. 
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electronic spin with a microwave pulse. We then read out the elec- 
tronic spin and, subsequently, the MN nuclear spin state (Fig. 4a, 
circuit diagram). The left-hand plot in Fig. 4a shows the read-out 
results for the electronic spin qubit, showing Rabi oscillations as a 
function of microwave pulse length. By contrast, the read-out results 
for the nuclear spin qubit (right-hand plot of Fig. 4a) show Rabi 
oscillations as a function of the radio-frequency pulse length. 

To quantify crosstalk, we closely examine correlations between the 
two measurement outcomes. We observe that the electronic spin Rabi 
oscillation depends on the outcome of the nuclear read-out (Fig. 4b), 
but this dependence can be fully accounted for by the finite microwave 
power used in this experiment (Supplementary Information). The 
observed correlations thus arise from imperfect manipulation rather 
than from measurement crosstalk. For the nuclear spin read-out, 
however, true measurement crosstalk appears: nuclear Rabi oscillation 
amplitudes decrease when the electronic spin is measured to be in 
ms =0 (Fig. 4c) because optical excitation during electronic spin 
read-out (which only succeeds for ms = 0) induces nuclear spin relaxa- 
tion (see Supplementary Information for details). This effect can be 
mitigated by improving the collection efficiency (thus reducing the 
read-out duration), for example by integrating the NV into an optical 
cavity. Also, application of moderate magnetic fields can decrease the 
optically induced nuclear spin relaxation rate by orders of magnitude’. 

Our results have implications for a broad range of spin-based appli- 
cations. Single-shot electronic spin read-out can drastically improve 
NV-based sensors by allowing fast, quantum-projection-limited 
detection, creating opportunities in low-temperature magnetome- 
try*'**. Extension of nuclear spin preparation techniques to remote 
nuclei in the spin bath may permit line-narrowing for enhanced sensi- 
tivity to d.c. magnetic fields. Furthermore, the preparation, manipula- 
tion and single-shot read-out of two spins opens the door to the 
exploration of two-particle quantum correlations, such as Bell’s 
inequalities, and elementary quantum information processing proto- 
cols. Importantly, the techniques we describe are extendable to larger 
spin registers, and can be combined with precise spin qubit control and 
dynamical decoupling to give coherence protection’’’. The prepara- 
tion and read-out fidelities reported here are sufficient for demonstrat- 
ing measurement-based entanglement generation and quantum 
teleportation of spin qubits, and for exploring elementary quantum 
error correction schemes’. Ultimately, the integration of multi-spin 
registers with quantum optical channels by means of spin—photon 
entanglement’* may allow their application as few-qubit nodes in 
long-distance quantum communication protocols or distributed 
quantum information processing networks. 


METHODS 


All data were obtained by detecting photons emitted into the phonon sideband 
(wavelength, 650-750nm). For photoluminescence excitation spectroscopy, 
excitation with 5.5-nW red light is applied while microwaves at 2.878 GHz, 
coupled through an on-chip stripline, drive the electronic spin transitions to 
prevent optical pumping. Scans are recorded in a single laser frequency sweep 
at ~200MHzs ', and are preceded by a 10-uls pulse of 532-nm excitation 
(50 1W). The green light is necessary to reset the negative-charge state of the 
NV, which can be photoionized by continuous resonant excitation. For all other 
experiments, 532-nm-induced spectral diffusion must also be controlled: to ensure 
that the NV is on resonance with the red excitation lasers, we condition our data on 
strong fluorescence upon simultaneous E, and A, excitation following the experi- 
mental sequence (details in Supplementary Information). All errors and error bars 
are 2s.e. statistical uncertainty in the mean (95% confidence interval). 


Received 11 June; accepted 28 July 2011. 
Published online 21 September 2011. 


1. Nielsen, M.A. & Chuang, |. L. Quantum Computation and Quantum Information 
(Cambridge Univ. Press, 2000). 

2. Raussendorf, R., Browne, D. E. & Briegel, H. J. Measurement-based quantum 
computation on cluster states. Phys. Rev. A 68, 022312 (2003). 

3. Elzerman, J. M. et a/. Single-shot read-out of an individual electron spin in a 
quantum dot. Nature 430, 431-435 (2004). 


578 | NATURE | VOL 477 | 29 SEPTEMBER 2011 


4. Vamivakas, A. N. et al. Observation of spin-dependent quantum jumps via 
quantum dot resonance fluorescence. Nature 467, 297-300 (2010). 

5. Neumann, P. et al. Single-shot readout of a single nuclear spin. Science 329, 
542-544 (2010). 

6. Morello, A. et a/. Single-shot readout of an electron spin in silicon. Nature 467, 
687-691 (2010). 

7.  Giedke, G., Taylor, J. M., D’Alessandro, D., Lukin, M. D. & Imamoéglu, A. Quantum 
measurement of a mesoscopic spin ensemble. Phys. Rev. A 74, 032316 (2006). 

8. Jiang, L. etal. Repetitive readout of a single electronic spin via quantum logic with 
nuclear spin ancillae. Science 326, 267-272 (2009). 

9. Balasubramanian, G. et a/. Ultralong spin coherence time in isotopically 

engineered diamond. Nature Mater. 8, 383-387 (2009). 

10. Naydenov, B. et al. Dynamical decoupling of a single-electron spin at room 

emperature. Phys. Rev. B 83, 081201(R) (2011). 

11. de Lange, G., Wang, Z. H., Risté, D., Dobrovitski, V. V. & Hanson, R. Universal 

dynamical decoupling of a single solid-state spin from a spin bath. Science 330, 

60-63 (2010). 

12. Ryan, C. A, Hodges, J. S. & Cory, D. G. Robust decoupling techniques to extend 

quantum coherence in diamond. Phys. Rev. Lett. 105, 200402 (2010). 

13. Togan, E. et al. Quantum entanglement between an optical photon and a solid- 

state spin qubit. Nature 466, 730-734 (2010). 

14. Batalov, A. et a/. Temporal coherence of photons emitted by single nitrogen- 

vacancy defect centers in diamond using optical Rabi-oscillations. Phys. Rev. Lett. 

100, 077401 (2008). 

15. Buckley, B.B., Fuchs, G. D., Bassett, L.C. & Awschalom, D. D. Spin-light coherence 

for single-spin measurement and control in diamond. Science 330, 1212-1215 

(2010). 

6. Robledo, L., Bernien, H., van Weperen, |. & Hanson, R. Control and coherence of the 
optical transition of single nitrogen vacancy centers in diamond. Phys. Rev. Lett. 
105, 177403 (2010). 

7. Childress, L., Taylor, J. M., Sorensen, A. S. & Lukin, M. D. Fault-tolerant quantum 
communication based on solid-state photon emitters. Phys. Rev. Lett. 96, 070504 
(2006). 

8. Barrett, S. D. & Kok, P. Efficient high-fidelity quantum computation using matter 
qubits and linear optics. Phys. Rev. A 71, 060310(R) (2005). 

9. Jiang, L., Taylor, J.M. & Lukin, M. D. Fast and robust approach to long-distance 
quantum communication with atomic ensembles. Phys. Rev. A 76, 012301 
(2007). 

20. Dutt, M. V. G. et a. Quantum register based on individual electronic and nuclear 

spin qubits in diamond. Science 316, 1312-1316 (2007). 

21. Neumann, P. et a/. Multipartite entanglement among single spins in diamond. 
Science 320, 1326-1329 (2008). 

22. Fuchs, G. D. et al. Excited-state spectroscopy using single spin manipulation in 
diamond. Phys. Rev. Lett. 101, 117601 (2008). 

23. Jacques, V. et al. Dynamic polarization of single nuclear spins by optical pumping 
of nitrogen-vacancy color centers in diamond at room temperature. Phys. Rev. Lett. 
102, 057403 (2009). 

24. Happer, W. Optical pumping. Rev. Mod. Phys. 44, 169-249 (1972). 

25. Blatt, R. & Zoller, P. Quantum jumps in atomic systems. Eur. J. Phys. 9, 250-256 
(1988). 

26. Tamarat, P. et al. Spin-flip and spin-conserving optical transitions of the nitrogen- 
vacancy centre in diamond. N. J. Phys. 10, 045004 (2008). 

27. Atature, M. et al. Quantum-dot spin-state preparation with near-unity fidelity. 
Science 312, 551-553 (2006). 

28. Manson, N.B., Harrison, J. P. & Sellars, M. J. Nitrogen-vacancy center in diamond: 
model of the electronic structure and associated dynamics. Phys. Rev. B 74, 
104303 (2006). 

29. Fu,K.-M.C. etal. Observation of the dynamic Jahn-Teller effect in the excited states 
of nitrogen-vacancy centers in diamond. Phys. Rev. Lett 103, 256404 (2009). 

30. Hadden, J. P. et a/. Strongly enhanced photon collection from diamond defect 
centers under microfabricated integrated solid immersion lenses. Appl. Phys. Lett. 
97, 241901 (2010). 

31. Degen, C. L. Scanning magnetic field microscope with a diamond single-spin 
sensor. Appl. Phys. Lett 92, 243111 (2008). 

32. Taylor, J. M. et al. High-sensitivity diamond magnetometer with nanoscale 
resolution. Nature Phys. 4, 810-816 (2008). 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 


Acknowledgements L.R. acknowledges support by a Marie Curie Intra European 
Fellowship within the 7th European Community Framework Programme. L.R., H.B., B.H. 
and R.H. acknowledge support from the Dutch Organization for Fundamental Research 
on Matter (FOM) and the European Commission (SOLID). L.C. acknowledges support 
from Research Corporation for Science Advancement (RCSA). 


Author Contributions L.R., L.C.and H.B. conducted the experiments. L.R., L.C., H.B., B.H. 
and R.H. analysed the data. H.B. and P.F.AA. fabricated the devices. L.R., L.C. and R.H 
wrote the paper. All authors commented on the manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of this article at 
www.nature.com/nature. Correspondence and requests for materials should be 
addressed to R.H. (r.hanson@tudelft.nl). 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


doi:10.1038/nature10421 


Interannual variability in the oxygen isotopes of 
atmospheric CO, driven by El Nino 


Lisa R. Welp', Ralph F. Keeling’, HarroA. J. Meijer’, Alane F. Bollenbacher', Stephen C. Piper’, Kei Yoshimura‘, Roger J. Francey’, 


Colin E. Allison? & Martin Wahlen! 


The stable isotope ratios of atmospheric CO, ('80/'°O and '7C/'7C) 
have been monitored since 1977 to improve our understanding of 
the global carbon cycle, because biosphere-atmosphere exchange 
fluxes affect the different atomic masses in a measurable way’. 
Interpreting the '80/'°O variability has proved difficult, however, 
because oxygen isotopes in CO, are influenced by both the carbon 
cycle and the water cycle”. Previous attention focused on the decreas- 
ing '°0/'°O ratio in the 1990s, observed by the global Cooperative 
Air Sampling Network of the US National Oceanic and Atmospheric 
Administration Earth System Research Laboratory. This decrease 
was attributed variously to a number of processes including an 
increase in Northern Hemisphere soil respiration’; a global increase 
in C, crops at the expense of C3 forests*; and environmental condi- 
tions, such as atmospheric turbulence’ and solar radiation®, that 
affect CO, exchange between leaves and the atmosphere. Here we 
present 30 years’ worth of data on '*0/'°O in CO, from the Scripps 
Institution of Oceanography global flask network and show that the 
interannual variability is strongly related to the El Nifio/Southern 
Oscillation. We suggest that the redistribution of moisture and rain- 
fall in the tropics during an El Niio increases the '*O/'°O ratio of 
precipitation and plant water, and that this signal is then passed on 
to atmospheric CO, by biosphere-atmosphere gas exchange. We 
show how the decay time of the El Nifio anomaly in this data set 
can be useful in constraining global gross primary production. Our 
analysis shows a rapid recovery from El Nifo events, implying a 
shorter cycling time of CO, with respect to the terrestrial biosphere 
and oceans than previously estimated. Our analysis suggests that 
current estimates of global gross primary production, of 120 peta- 
grams of carbon per year’, may be too low, and that a best guess of 
150-175 petagrams of carbon per year better reflects the observed 
rapid cycling of CO. Although still tentative, such a revision would 
present a new benchmark by which to evaluate global biospheric 
carbon cycling models. 

Quantifying global-scale gross primary production (GPP) has been 
difficult because there are no direct measures at scales greater than the 
leaf level. Consequently, inferences are made from atmospheric-based 
CO, flux measurements (for example eddy covariance’), but these can 
only measure the net ecosystem exchange, that is, the small residual 
between GPP and ecosystem respiration. Satellite-based observations 
estimate GPP on the basis of absorbed photosynthetic radiation®. Both 
approaches rely heavily on biospheric process models to generate GPP 
estimates. We present a new approach that does not depend on bio- 
spheric process models, using existing long-term time series of CO2 
isotopes to quantify mean global GPP over the past several decades. 

Stable isotope analysis of CO, started with flask samples from the 
South Pole (SPO) and Christmas Island stations in 1977 and later 
expanded to include the Alert (ALT), Point Barrow, La Jolla, Mauna 
Loa (MLO), Cape Kumukahi, American Samoa, Kermadec Islands and 
Baring Head stations. Figure 1 shows deseasonalized monthly mean 


'80/'°O observations and spline fits at each station expressed using 
delta notation, 8'°O = ('°O/"°O),ampie/("*O/"°O)vpps — 1, with the 
Vienna Pee Dee Belemnite (VPDB) standard. Interannual variability 
of 5'8O0-CO, (that is, 5'°O of the CO, samples) is characterized by 
oscillations of 2-10 yr, with relatively consistent timing and magnitude 
of anomalies at all stations and without any persistent long-term trend. 
The lack of a trend in '*O/'°O contrasts with the increasing trend in 
the CO, mixing ratio, of roughly 1.5 p.p.m.yr_', and the decreasing 
trend in 8'°C = (?°C/'7C)sampte/(°C/'*C)vppp — 1, of —0.02%o yr; 
over the same period’. This is consistent with the fact that fossil fuel 
emissions contribute little to the '°0/'°O budget of CO, and that 
oxygen atoms in CO, are renewed by exchange with oxygen atoms 
of water in leaves, soils and sea water approximately every 2 yr (refs 2, 
9, 10) or less, as will be discussed below. Fortunately, signal-to-noise 
ratios are generally much larger for 8'°O than for 5'°C in CO. 

Cross-correlation analysis determined that 8'*O-CO, anomalies at 
station SPO lag anomalies at the low-latitude station MLO by 4-5 
months. Stations ALT and PTB, which are at high northern latitudes, 
lag MLO by 3-4 months. This suggests that anomalies start in the 
tropics and propagate to high latitudes over the course of several 
months. Maximum correlation coefficients between SPO and both 
MLO and ALT were approximately 0.4 when allowing for lags between 
stations. 

During El Nifo/Southern Oscillation (ENSO) events, there are sig- 
nificant changes in the amount and intensity of rainfall in the tropics, 
especially over Southeast Asia and northern South America’. Changes 
in the amount of precipitation also affect the '°O value of water 
through the ‘amount effect’ and related processes’*. Figure 2 shows 
the correlation between the ENSO precipitation index'* (ESPI) and the 
5'80 value of precipitation as hindcast by the IsoGSM model'* driven 
by reanalysis winds. During El Nifio years, the 5'*O value of precip- 
itation (6,,,) increases in regions where the amount of precipitation 
decreases, such as Southeast Asia and northern South America. These 
are also regions with very high annual net primary production (NPP) 
and GPP. Figure 3a, b shows that 5'*O-CO) is strongly positively 
correlated with Oppt in the tropics, whereas Fig. 3c, d shows that in 
the same regions it is negatively correlated with relative humidity. This 
further supports a link between 5'*O-CO, and ENSO variability 
through the tropical water cycle. 

The interannual variability of 5'°O-CO, measured by the Scripps 
Institution of Oceanography (SIO) flask network was confirmed by 
comparison with an independent record of similar length from Cape 
Grim (CGO) station as measured by the Commonwealth Scientific and 
Industrial Research Organisation’? (CSIRO), also shown in Fig. 1. 
CGO and SPO have very similar correlations with ENSO forcing, 
lagging ENSO by 8-11 months with maximum correlation coefficients 
of 0.4. SIO flask observations were also compared with those of the US 
National Oceanic and Atmospheric Administration Earth System 
Research Laboratory (Supplementary Information, section 1). 


1Scripps Institution of Oceanography, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093-0244, USA. *Center for lsotope Research, University of Groningen, Nijenborgh 4,9747 
AG Groningen, The Netherlands. 3CSIRO Marine and Atmospheric Research, PB 1, Aspendale, Victoria 3195, Australia. Present address: Atmosphere and Ocean Research Institute, University of Tokyo, 5-1- 
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Figure 1 | Measurements of 5'*O-CO, from the SIO flask network and 
CSIRO. Monthly mean deseasonalized 5'8O-CO, station data (black dots) 
with long-term spline fits (grey lines). All stations (latitude, longitude and 
altitude as shown) are from the SIO flask network with the exception of the 
CSIRO station CGO, at the bottom of the figure. A seasonal harmonic fit was 
subtracted from the monthly means to produce the deseasonalized 
observations presented here. The 1o mass spectrometer precision of both 
laboratories since 1990 is ~0.014%o. The SIO estimates that the lo error of 
duplicate flask measurements is ~0.025%o. Measurement uncertainties for 
both laboratories are larger before 1990. 


We used a simple empirical model to demonstrate that the inter- 
annual variability in 8'*O-CO), is closely related to a low-pass filter of 
ENSO. The model follows from mass balance considerations (see 


580 | NATURE | VOL 477 | 29 SEPTEMBER 2011 


80° N 
60° N 
40°N 
20°N{ 

0° 


Latitude 


20° S| > 
40° S|. 


60°S 
180° 


120° W 60° W o° 


Longitude 


60°E 120°E 180° 


0.7 -0.6 -0.5 -0.4 03-02-01 0 01 0.2 03 04 05 06 07 
Correlation coefficient 


Figure 2 | Correlations between precipitation 5'°O from IsoGSM and 
ENSO. We use the ENSO precipitation index (ESPI) as a proxy for ENSO 
variability. Positive correlation indicates that precipitation is enriched in '*O 
during the El Nifo phase. 


Supplementary Information, section 2, for a full derivation), treating 
the troposphere as two well-mixed boxes, one for the Northern 
Hemisphere and one for the Southern Hemisphere (N and S, respec- 
tively), assuming a linear ENSO forcing on the 5'*O-CO, values for 
each hemisphere (dy and dg, respectively) through the influence of the 
terrestrial biosphere and incorporating non-ENSO background pro- 
cesses that effectively damp the ENSO isotope anomalies: 


diy dy | Os—dy 

a x A x ENSO'(t — lag) 7 ee (1) 
dos _ (1 —f,) x Ax ENSO'(t lag) 95 is (2) 
dt Ts Tmix 


Fitting parameters include fy, A, Ty, Ts and lag, where fy represents the 
fraction of the total ENSO forcing within the Northern Hemisphere; A 
is the scaling term in units of %oESPI ‘yr ', and represents the 
conversion of the ESPI anomalies from the mean (ENSO’) into 
5'3O-CO, anomalies from the mean (dy’ or ds’, depending on hemi- 
sphere); ty and ts are the Northern Hemisphere and Southern 
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Figure 3 | Correlations between flask 5'*O-CO, records and precipitation 
550 and relative humidity from IsoGSM. a, b, Correlation between the 
previous 9-month running mean of precipitation 8'°O and the monthly mean 
deseasonalized 5'*O-CO, from station MLO (a) and station SPO (b). Positive 
correlations indicate that 5'"O-CO, increases when precipitation 5'°O 
increases. c, d, Correlation between the NPP-weighted, 9-month running mean 
of negative relative humidity and deseasonalized 5'°O-CO, from MLO (c) and 
SPO (d). Positive correlations indicate that 3'8O-CO, increases when relative 
humidity decreases. NPP-weighted relative humidity was used here to give 
weight to months with higher vegetation productivity when leaf-water isotope 
anomalies are passed onto COp. 
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Hemisphere turnover times of oxygen atoms in CO); and lag is the 
temporal offset between ESPI and the isotopic ENSO forcing. The 
constant T,,;;, which represents the interhemispheric mixing time, 
was set to a value of 12 months. 

The box model approach assumes that surface 5'*O-CO, measure- 
ments represent the entire tropospheric column. Although aircraft 
measurements have observed variability within the troposphere, the 
fact that the seasonal cycle of 8'*O-CO, observed at MLO is similar to 
that in the upper troposphere’® means that the box model may be a 
sufficient first attempt because we are tracking a signal that originates 
in the tropics and is transported vertically before reaching the mid 
latitudes. 

We alternately used MLO and ALT to represent the Northern 
Hemisphere box, and SPO and CGO for the Southern Hemisphere 
box (Supplementary Information, section 3). We repeated the fits 
using 1,000 hypothetical time series with the same autocorrelation as 
the ENSO index but with random phase”. All station combinations 
gave fits to the true ENSO index that were better than a large majority 
of fits using the hypothetical indices, more than 98% of cases using 
the SIO station combinations and more than 93% using SIO-CSIRO 
station combinations (Supplementary Information, section 4). The 
MLO-SPO combination of data from 1980 to 2009 yielded the best 
fit between the ENSO model and observations (Fig. 4). 

The model fit yielded a short turnover time of ty = 0.4-0.8 yr for 
5'80-CO, in the Northern Hemisphere, depending on the stations 
used in the analysis and with lo errors not greater than 0.3 yr. 
Sensitivity analysis of the model parameter fit covariance suggested 
that ts is greater than 2 yr, but the upper bound was difficult to deter- 
mine because of the 1-yr interhemispheric mixing time (t,,jx). We 
expected ty to be less than ts given the larger land biosphere and 
the greater exchange of CO; with leaf and soil water in the north. 
Estimated turnover times for the two hemispheres based on gross 
exchanges with the ocean’® and the terrestrial biosphere’’ indicated 
that the Southern Hemisphere turnover time is ~120% longer than 
that of the Northern Hemisphere. The model-derived, hemisphere- 
specific turnover times correspond to a global tropospheric mean of 
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Figure 4 | ENSO index and two-box model results. a, ESPI was used as a 
proxy for ENSO variability. Arrows denote El Nifio events. b, Empirical ENSO 
model fit for the Northern Hemisphere (dy, black line), compared with the 
deseasonalized monthly flask observations (black dots) and spline fit or 
interannual variability at station MLO (grey line). c, Same as in b but for the 
Southern Hemisphere model fit (6;) and station SPO. 
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0.7-1.4 yr. Allowing for the 22% of the atmospheric mass contained in 
the stratosphere”, which was not included in the model, the estimated 
global atmospheric turnover time increases to 0.9-1.7 yr. 

These atmospheric observations suggest that the turnover time for 
oxygen atoms in CO, may be slightly lower than previously published 
estimates. Turnover times of oxygen in CO) of 1.7 yr (ref. 2) and 1.5 yr 
(ref. 10) have been estimated from bottom-up flux estimates using pro- 
cess models. These estimates are sensitive to assumed values for GPP and 
the ratio of CO, concentration inside the leaf relative to the atmosphere. 
Our turnover time estimate using 5'*O-CO, anomalies is independent 
of these assumptions. Recent efforts to balance the carbonyl sulphide 
budget also point to more exchange of CO, between the atmosphere and 
land vegetation than previously estimated”’. Field studies have indicated 
that carbonic anhydrase in soils catalyses the direct exchange of oxygen 
isotopes between soil water and CO}. A recent global estimate of this soil 
equilibration flux could bring the global turnover time of ref. 10 down as 
low as ~0.9 yr (ref. 22). 

The parameter A provides insight into the mechanisms linking 5'°O- 
CO, to ENSO climate anomalies. The fits yielded A values of 0.27 to 
0.35%o ESPI ! yr, depending on the stations used, with lo errors of 
less than 0.08%0o ESPI_' yr‘. To assess which processes contribute most 
to A, we carried out a scale analysis of the mass balance of 180 in CO, 
(Supplementary Information, section 5). We identified two dominant 
processes. The first was a change in the 6,, value associated with ENSO 
events. Precipitation 5'°O anomalies are directly transmitted to soil and 
leaf water, and hence contribute to the 5!80 value of CO). Generally, the 
Oppt value on land in the tropics increases during El Nifo events, as 
evidenced by isotopic studies of precipitation***, tropical ice cores”, 
tree rings*® and isotope-enabled general circulation models'*. The 
second dominant effect was an increase in the isotopic enrichment 
of leaf water relative to soil water caused by lower relative humidity 
during warm and dry El Nino events in the tropics than in La Nina 
years”. 

We estimate that the effects of dp and relative humidity are similar 
in magnitude and reinforce each other. The magnitude of A can 
be accounted for if we assume that anomalies in 6,); and relative 
humidity influenced 40% of the global land CO, fluxes. NPP from 
20°S to 20°N is about 40% of global NPP”, lending support that 
the global anomalies of 5'°O in CO, may be generated by the tropical 
land biosphere that is modulated by ENSO. Our analysis showed that 
many other possible processes were less important, including changes 
in CO, fluxes (for example those related to drought) and fire emis- 
sions. Our analysis neglected ENSO-related changes in air-sea CO, 
exchange, in stratosphere-troposphere exchange and in diffuse light, 
which are also likely to be small. 

The model-derived fraction of isotopic ENSO forcing injected into 
the Northern Hemisphere (fy) was 0.49-0.76 depending on the station 
combinations used. Lags between ESPI and atmospheric 5'*O-CO, 
forcing were small, ranging from —2 months to 1 month, and had 
lo errors of 1 month. The model thus successfully accounts for the 
observed lag between ESPI and 5'*O-CO, through the finite hemi- 
spheric turnover times. 

We can relate our model-derived turnover time to the gross flux of 
CO, equilibrating with the ocean and land surface water. This gross 
exchange flux can be used to estimate GPP by subtracting air-sea 
exchange; the amount of CO, that escapes leaf interiors without being 
fixed into organic matter, which is directly related to the ratio of CO, 
inside the leaf to that in the atmosphere; and the soil invasion flux 
(Supplementary Information, section 6). Reconciling the short turn- 
over time (0.9-1.7 yr) with current understanding of the global carbon 
budget would require that the soil invasion CO, flux be near the upper 
case considered in ref. 22 and much greater than soil respiration. At 
present, there are not enough field observations to confirm such high 
fluxes of soil invasion. Alternatively, and more plausibly, the fast res- 
ponse can be accounted for by revising global GPP upwards from 
120PgCyr ' (ref. 7) to 150-175 PgCyr *. Using this approach, we 
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argue that 120 Pg Cyr’ * may actually be a lower bound on GPP. (See 
Supplementary Information, section 6, for more discussion.) 

The timescale of the damping of the ENSO 5'8O-CO, signal pro- 
vides a constraint on global GPP without the need for detailed isotopic 
modelling of multiple surface water pools. We have achieved with 
5'8O-CO, what was believed to be possible only by combining '7O 
and '8O data from CO; (ref. 28). These results can be used to validate 
biospheric components of carbon cycle models used to predict future 
climate change. The measurements and modelling presented here 
highlight the significance of 5'°O-CO, long-term records as integra- 
tors of the carbon and water cycles. 


METHODS SUMMARY 


Air samples were collected, without drying, in evacuated 5-1 glass flasks with a 
single glass stopcock lubricated with Apiezon grease, and were returned to the 
laboratory for analysis. Until 1992, isotopic analysis was performed at the CIO 
University of Groningen, The Netherlands; since then, it has been done at the SIO, 
La Jolla, California. Offsets between the laboratories have been investigated and 
corrected”. We curve fitted the daily mean of replicate flask samples with spline 
and seasonal harmonic components, and monthly means were constructed from 
observations adjusted to the fifteenth day of each month”’. The interannual vari- 
ability is simply the spline component of the curve fit. Deseasonalized monthly 
means were calculated by subtracting the seasonal harmonic component from the 
monthly mean observations. 

Parameter fitting of the two-box model was done using a nonlinear least-squares 
data fitting function in MATLAB (‘Isqnonlir’) to find the minimum of the sum of 
squares of the differences between deseasonalized monthly mean flask observa- 
tions from one station in each hemisphere and the model predictions from equa- 
tions (1) and (2), weighted by 1/o for each station. Estimates of the 1¢ error for each 
station’s measurements were not obtained from the mass spectrometer precision 
but rather from the standard deviation of the residuals between the deseasonalized 
monthly mean flask observations and the spline fits, which were ~0.08%o. 

Isotope data of CO, from the SIO network is publicly available at http://scrippsco2. 
ucsd.edu/data/atmospheric_co2.html. 
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Transcriptomic analysis of avian digits reveals 
conserved and derived digit identities in birds 


Zhe Wang", Rebecca L. Young', Huiling Xue” & Giinter P. Wagner! 


Morphological characters are the result of developmental gene 
expression. The identity of a character is ultimately grounded in 
the gene regulatory network directing development and thus 
whole-genome gene expression data can provide evidence about 
character identity. This approach has been successfully used to 
assess cell-type identity'*. Here we use transcriptomic data to 
address a long-standing uncertainty in evolutionary biology, the 
identity of avian wing digits*’. Embryological evidence clearly 
identifies the three wing digits as developing from digit positions 
2, 3 and 4 (ref. 6), whereas palaeontological data suggest that they 
are digits I, II and III’. We compare the transcriptomes of the wing 
and foot digits and find a strong signal that unites the first wing 
digit with the first foot digit, even though the first wing digit 
develops from embryological position 2. Interestingly, our tran- 
scriptomic data of the posterior digits show a higher degree of 
differentiation among forelimb digits compared with hindlimb 
digits. These data show that in the stem lineage of birds the first 
digit underwent a translocation from digit position 1 to position 2, 
and further indicate that the posterior wing digits have unique 
identities contrary to any model of avian digit identity proposed 
so far®*. 

To address the problem of avian digit identity we performed 
mRNA-seq analysis from pools of 40 individually isolated chicken 
forelimb and hindlimb digits at two stages of development: stages 
28/29 and 31 (Fig. lad). A total of 14,692 annotated genes was 
detected and compared among samples. Sampling pools of popula- 
tions are highly reproducible (Supplementary Fig. 1). In describing the 
results we distinguish nominal digit identities, based on position, from 
biological digit identities; that is, those that indicate homologies. 
Nominal digit identities use a three-letter code, the first giving develop- 
mental stage (E, early for stage 28/29; L, late for stage 31) and the 
second giving limb identity (F, forelimb; H, hindlimb). The two letters 
are followed by a lowercase letter giving the anterior—posterior location 
of chondrified digits, with ‘a’ for the most anterior digit and ‘b’, ‘c’ and 
‘d’ for the posterior digits. In contrast, biological digit identities (homo- 
logies) are indicated by roman numerals I, II, HI and IV. Finally, 
embryonic digit positions are indicated with Arabic numerals from 
anterior to posterior: 1, 2, and so on. These terminological differences 
are necessary to distinguish descriptive references to certain digits in 
the adult hand (a, b, etc.) from statements about embryological digit 
positions (1, 2, etc.) and biological digit identity or homology (digits I, 
II, etc.). 

Transcriptomic data have not been systematically applied to address 
questions of digit homology. At the most general level, the transcrip- 
tome is a phenotypic character and is thus a legitimate source of 
evidence regarding character identity. Gene expression data from a 
limited number of genes have successfully been applied to questions of 
homology in the past?"’. Cell-type identity is another area where gene 
expression has proven to be valuable'**. Here we refer to homology as 
the historical continuity of the developmental programme that deter- 
mines developmental individuality of the character. By this definition, 


homology is thus tied to continuity in the expression of regulatory 
genes’”"®, 

First we performed a multidimensional scaling (MDS) analysis’” to 
identify the overall similarity structure of the transcriptomes (Fig. le). 
After removing the effect of forelimb-hindlimb differences—thatis, remov- 
ing differentially expressed genes between forelimb and hindlimb—the 
ordination analysis reveals four clearly resolved clusters correspond- 
ing to early and late anterior digit clusters, and two clusters of early and 
late posterior digits. The transcriptomic data thus contain two strong 
signals: one pertaining to the difference between the first digits and the 
posterior digits, and the other reflecting developmental stages. 

The heat map of correlation coefficients between samples and 
hierarchical clustering analyses based on the expression of all detected 
genes also give strong support for a cluster of anterior digits (Fig. lfand 
Supplementary Fig. 2). These data indicate that the anterior digits in 
the wing and the hindlimb are homologous (Fa = Ha). There is broad 
agreement that the anterior-most digit in the chicken foot is homo- 
logous to digit I in the ancestral five-digited limb of crown-group 
tetrapods (Ha = DI). We conclude that the most anterior digit in the 
wing is likely to be homologous to digit I. The transcriptomic data are 
consistent with the morphological data that show a continuity of digit I 
identity along the theropod lineage’. 

The existence ofa cluster that unites the forelimb and hindlimb digits 
lis significant because the first forelimb digit in the chick develops from 
the embryonic position that normally develops into digit II, that is, Fa 
develops from embryonic position F2 whereas Ha develops from 
embryonic position H1 (ref. 18). The difference between the digit I 
cluster and the posterior digits could reflect the posterior to anterior 
gradient in digit development’’. However, the separation of the anterior 
digit I cluster is also found in stage 31, where the anterior and the 
posterior digits are at comparable levels of differentiation (compare 
Fig. 1c, d). To analyse which genes are driving the digit I cluster, we 
identified 556 significantly differentially expressed genes between digit 
I and the other digits (Supplementary Fig. 3). Gene Ontology (GO) 
analysis of these differentially expressed genes detected seven enriched 
GO terms of molecular function and biological process. Transcription 
factor activity is the top GO term, indicating that the transcriptome 
data sort digits according to their gene regulatory signature (Sup- 
plementary Table 1). Other over-represented GO terms are DNA bind- 
ing, regulation of transcription, and skeletal system development. 
These results indicate that the distinction between the first digits and 
the rest of the digits reflects the developmental individuality of digit I 
relative to the posterior digits; that is, Fa and Ha are biologically equi- 
valent or true homologues, as suggested by palaeontological’ and in situ 
hybridization (ISH) studies*®”’. This result suggests that, in the ther- 
opod hand, the embryological position of digit I has changed from 
position 1, as in the ancestor of amniotes, to position 2 in the stem 
lineage of birds, consistent with the frame shift hypothesis of avian digit 
identity’. 

To gain more insight into the genetic nature of digit I identity we 
searched for digit I markers and identified Hoxd12, Hand2, Zic3 and 
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Figure 1 | Chicken embryonic digit transcriptomes identify forelimb and 
hindlimb digits I but no clear correspondence between posterior forelimbs 
and hindlimbs. a—d, Alcian-blue-stained autopods. Red lines indicate the 
dissection protocol used to obtain the samples for the RNA-seq analysis. Scale 
bar, 1 mm. a, b, Stage 28; c, d, stage 31. e, Ordination of transcriptome 


Lhx9. Hoxd12 and Hand2 are well known negative markers of the 
anterior-most digit”’”** (Supplementary Figs 4 and 5 and Supplemen- 
tary Discussion). We found two novel positive markers of digit I: Zic3 
and Lhx9 (Figs 2 and Supplementary Fig. 6). 

Zic3 is always expressed more strongly in the first digit than the 
posterior digits, in particular at stages 26 and 28 (Fig. 2). Zic3 is a 
transcriptional co-factor that physically interacts with Gli3, which is 
important in the transduction of the Shh signal”. Digit I has been 
proposed to be refractory to Shh signalling’, and the interaction of 
Zic3 protein with Gli3 may have an involvement in this (Supplemen- 
tary Discussion). Lhx9 is a LIM homeodomain transcription factor 
that controls three-dimensional limb patterning in mouse” (Sup- 
plementary Discussion). 

The similarity analysis of digit transcriptomes presented above does 
not determine to which hindlimb digits the second and third wing 
digits correspond. The inspection of the cluster analysis and the cor- 
relation coefficients show a pattern of conflicting signals (Fig. 1f and 
Supplementary Fig. 7). There are two well supported clusters that unite 
the second wing digit with the third hindlimb digit, (EFb, EHc) and 
(LFb, LHc), but these two clusters are united with the late second 
hindlimb digit, LHb. There is a cluster (EHd, LHd, LFc) that is indi- 
cative of the third digit in the wing having an affinity to the fourth 
hindlimb, but there is also a cluster that unites the early third forelimb 
digit with the early second hindlimb digit (EFc, EHb), indicating an 
incoherent signal. To identify the source of this ambiguity we propose 
that the degree of differentiation among the posterior digits might 
differ between forelimbs and hindlimbs. 
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similarity, after removing forelimb-hindlimb signals. f, Heat map of Pearson’s 
correlation and cluster analysis. Red numbers at the nodes represent 
approximately unbiased bootstrap values. g, Comparison between posterior 
forelimb digits and between neighbouring posterior hindlimb digits by 
Pearson’s correlation coefficients. 


To test this possibility we inspected the correlation coefficients 
between the posterior digits within each limb (Fig. 1g). In the early 
digit samples there is a strong negative correlation between the second 
and third wing digits but a weaker negative correlation between the 
second and third hindlimb digits and a positive correlation between 
the third and fourth hindlimb digits. In the late samples, the correla- 
tion between the second and third wing digits is still negative although 
weaker than for the early samples, whereas the correlations between 
the neighbouring posterior hindlimb digits are both positive. Together, 
these data indicate a much weaker differentiation between neighbour- 
ing posterior hindlimb digits than that between the posterior forelimb 
digits. We also performed differential expression analysis showing that 
the differences between posterior forelimb digits are larger than the 
difference between neighbouring posterior hindlimb digits (Sup- 
plementary Fig. 8). Overall, these results suggest that the second and 
third wing digits diverged and may have acquired derived digit iden- 
tities during theropod evolution, making it difficult to find corres- 
ponding digit identities in the hindlimb. 

To discover genes that specifically contribute to the second and 
third wing digit identities, we performed differential expression ana- 
lysis of the mRNA-seq data between samples LFb and LFc. We found 
two genes, Tbx3 and Socs2, with high expression in sample LFc 
(Supplementary Fig. 9 and Fig. 3a). To our knowledge no studies have 
been published indicating a role for Socs2 in limb development. ISH 
confirms its strong expression in the third forelimb digit to the exclu- 
sion of all other digits in forelimb and hindlimb (Fig. 3b-g). Recently it 
has been shown that the third forelimb digit has a unique mode of 
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Figure 2 | Expression of Zic3. a, Expression pattern of Zic3 in mRNA-seq 
samples. The y axis is the trimmed mean of M-values (TMM)*° normalized 
mRNA-seq read-count value. b-g, Expression of Zic3 in forelimb and 
hindlimb buds visualized by ISH. b, c, Stage 26; d, e, stage 28; f, g, stage 30. The 
dorsal view of the autopod is shown with anterior up and distal to the right. The 
white arrowheads in b-e indicate the expression boundary between first and 
second digit. The red arrows in f, g indicate the proximal expression of Zic3. 
Scale bar, 1 mm. 


development in birds*. This, combined with our gene expression 
survey, supports the idea that the third wing digit has a unique derived 
identity in birds. 

The data presented here reveal two patterns. First, there is a strong 
signal that identifies the anterior-most digits in the forelimb and 
hindlimb as homologous, in spite of the fact that they develop in 
different embryological positions. The second pattern is that no clear 
correspondence was found between the posterior forelimb digits and 
the posterior hindlimb digits. A comparison of digits within each limb 
shows that the posterior forelimb digits are more strongly differen- 
tiated than the posterior hindlimb digits, and forelimb digit III exhibits 
a unique expression of Socs2 (Fig. 3d, f and Supplementary Fig. 10). 
These results provide evidence that the second and third wing digits 
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Figure 3 | Expression of Socs2. a, Expression pattern of Socs2 in mRNA-seq 
samples. The y axis is the TMM normalized mRNA-seq read-count value. 
b-g, Expression of Socs2 in forelimb and hindlimb digits visualized by ISH. 
b, c, Stage 29; d, e, stage 31; f, g, stage 33. Orientation of limb buds as in Fig. 2. 
The white arrowhead in b points at the expression domain at the third digit. 
Scale bar, 1 mm. 


are developmentally distinct from the second to fourth hindlimb 
digits, indicating that digit individualization can be dynamic and 
evolutionarily transient. 


Fourteen samples and two pals of twenty individuals from embryonic chicken 
digits were used to perform mRNA-seq on an Illumina Genome Analyzer. Digital 
primordia with their closely posterior interdigital mesoderm were dissected sepa- 
rately from the forelimbs and hindlimbs between stages 28 and 29 (stage 28/29; 
Fig. la, b). Digital rays without the interdigital tissues were collected from the 
forelimbs and hindlimbs at stage 31 (Fig. 1c, d). mRNA-seq data were used for 
clustering'””’, correlation analysis, differential gene expression** and Gene 
Ontology analysis. For ISH, we followed a previously published protocol’’. All 
the probes are labelled with digoxigenin (Roche, catalogue no. 11277073910) and 
hybridized with the chicken embryos at 70 °C. 
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METHODS 


Sample collection. Fertilized chicken eggs were obtained from Charles River 
Laboratories International and incubated between 37.5 and 39 °C at high relative 
humidity (>50%). To obtain sufficient RNA for transcriptome sequencing, digit 
samples were collected and pooled from 20 individuals. Previous studies show that 
digit identity is regulated by the closely posterior interdigital mesoderm and that 
digit identities of digit I/II and III are not fixed until stages 30 and 29 in the chicken 
hindlimb*!’. Thus, we use digital primordia with their closely posterior inter- 
digital mesoderm to compose a sample for mRNA-seq at stage 28/29, and only the 
digital ray as a sample at stage 31 (Fig. la-d). Collections were performed at two 
time points during egg incubation to obtain stage 28/29 and stage 31 embryos. 
Staging is based on Hamburger and Hamilton”. Digital primordia with their closely 
posterior interdigital mesoderm were dissected separately from the forelimb and 
hindlimb between stages 28 and 29 (stage 28/29; Fig. la, b). Digital rays without 
interdigital tissues were collected from the forelimb and hindlimb at stage 31 
(Fig. 1c, d). In both cases, we removed the tissue anterior to the first digital ray of 
the forelimb. This tissue is thought to be the residual condensation of the ancestral 
digit I'*. 

mRNA-seq. Total RNA was extracted using the RNeasy Mini kit (Qiagen), and 
treated with DNase I using RNase-Free DNase set (Qiagen). Extracted total RNA 
of the samples was submitted to Yale Center for Genome Analysis for mRNA 
sequencing (mRNA-seq) using an Illumina Genome Analyzer. mRNA-Seq 
8-Sample Prep kit (Illumina), Standard Cluster Generation kit (lumina) and 
Illumina Sequencing kit (Illumina) were used for constructing libraries for single- 
read sequencing, cluster amplification and final sequencing-by-synthesis. Two flow 
cells, each containing 8 lanes and 8 samples including the Phi X174 control, were 
used. Image analysis, base calling, extraction of 34-bp reads, and read counting were 
performed using the Illumina pipeline. 

Sequence alignment, gene counting and normalization. Reads from mRNA-seq 
data were aligned to Ensembl chicken genome (Gallus gallus W ASHUC2) using the 
GERALD module of Illumina CASAVA v1.6 software. After alignment, raw gene 
counts (gene expression levels) were counted by Illumina CASAVA v1.6 software, 
and then normalized using the TMM method, which takes RNA composition bias 
into account, using the edgeR package’’**. The normalization factors were calcu- 
lated by edgeR’’*® for each library and then the normalized read counts were 
calculated by the following formula: normalized count = raw count ~ (library size 
X normalization factor) X 10°. The normalized count was directly used for sub- 
sequent analyses (see below), and was also divided by the cDNA length to adjust for 
gene length and then used in the same analysis. 

Differential expression analysis. DEGseq”* packages were used to identify dif- 
ferentially expressed genes between digit I (EFa, LFa, EHa and LHa; Fig. la-d) and 
all the other digits (EFb, EFc, LFb, LFc, EHb, EHc, EHd, LHb, LHc and LHd; 
Fig. la-d), and between LFb and LFc samples (Fig. 1c, d). Function DEGexp was 
performed to identify differentially expressed genes and the R scripts for using this 
function are provided in the User’s Guide of DEGseq”*. DEGseq”* was also used to 
identify differentially expressed genes between forelimb (EFa, EFb, EFc, LFa, LFb 
and LFc; Fig. la-d) and hindlimb digits (EHa, EHb, EHc, EHd, LHa, LHb, LHe 
and LHd; Fig. la—d), and these genes were removed from the input data for the 
MDS plot (see below). Owing to the underestimate of the true variability (resulting 
from pooling individuals), more differentially expressed genes may be removed, 
which actually makes the downstream approach (MDS plot) conservative. A gene 
is considered significantly differentially expressed when the Q-value (adjusted 
P value for multiple testing) is lower than 0.05 (ref. 34). 

MDS plot. The MDS plot is conducted by the edgeR package’. The input data are 
normalized read counts of 5,813 genes after removing genes that are significantly 
differentially expressed between forelimb digits and hindlimb digits. The distance 
between each pair of samples in the output figure is the square root of the common 
dispersion for the top 100 genes that best distinguish that pair of samples'”. These 
top 100 genes are selected according to the tagwise dispersion of all the samples’”. 
Cluster analysis and heat map of correlation. We produced two sets of data from 
the normalized data by different methods. The two data sets are separately used for 
hierarchical clustering and calculating either Pearson’s correlation coefficient or 
Spearman’s rank correlation coefficient. The first method considers the mRNA- 
seq data to follow a Poisson distribution and calculates the square root values of the 
normalized data (the canonical variance-stabilizing transformation for Poisson). 
To calculate the input data, the 14 samples are divided into four groups: group 1 
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(EFa, EFb and EFc), group 2 (EHa, EHb, EHc and EHd), group 3 (LFa, LFb and 
LFc) and group 4 (LHa, LHb, LHc and LHc) depending on developmental stages 
and forelimb/hindlimb. A distance is calculated by subtracting the mean of square 
root values of a group (for example, group 1: all the forelimb digits at stage 28/29) 
from each square root value of a sample (for example, EFa). The distance matrix is 
used for clustering and calculating the correlation coefficient between samples. 

The second method considers that the normal distribution, with mean 1 and 
variance / (standard deviation: the square root of A), is an approximation to the 
Poisson distribution because of large values of 2 (in this study, 2 ~ 700). Thus, a 
standard score (also called z-value, z-score), which indicates how many standard 
deviations a datum is above or below the mean, is used for clustering and calculat- 
ing Pearson’s correlation coefficient between samples. 

A heat map of correlation coefficients between samples was created by the R 
Lattice package. Clustering is conducted by pvclust, an R package for hierarchical 
clustering with P values”. After bootstrap re-sampling (10,000 iterations), approxi- 
mately unbiased (AU) P value is provided in the output figure. AU P value, which is 
calculated by multiscale bootstrap re-sampling, is a better approximation to 
unbiased P value than bootstrap probability value calculated by ordinary bootstrap 
re-sampling. Clusters are formed using the average dissimilarity between the 
samples. An AU P value higher than 95% indicates that the cluster is highly 
supported”, 

Gene Ontology analysis. Gene Ontology analysis is performed by the GOseq 
package**, which takes gene length bias into account. We first identify the significantly 
differentially expressed genes that are driving the clustering by edgeR package’’ and 
then identify significantly enriched GO categories using a 0.05 cutoff for the false 
discovery rate. Default parameters and methods in the packages are used. 

In situ hybridization. Total RNA was extracted from chicken embryonic limb 
buds using the RNeasy Mini kit (Qiagen), and treated with DNase I using RNase- 
Free DNase set (Qiagen). cDNA was synthesized from the total RNA using the 
High Capacity cDNA Reverse Transcription kit (Applied Biosystems). The whole 
coding sequence of Hand2 was amplified using a pair of specifically designed 
primers (forward primer, 5'’-GCGGCGATGAGTCTTGTG-3’; reverse primer, 
5'-CTCACTGCTTGAGCTCCAGC-3’; product, 658 bp). Partial coding sequence 
followed by partial 3’ UTR of Zic3 was amplified using a pair of primers previously 
reported but without the additional T7 RNA polymerase promoter sequence 
(forward primer, 5’-CAGCAAGGACTCCACGAAAAC-3’; reverse primer, 
5'-CGACCCCATCAGATGAGAAT-3’; product, 723 bp)**. Most of the coding 
sequence of Lhx9 (914bp) was amplified using a pair of primers previously 
reported’’. Most of the coding sequence followed by partial 3’ UTR of Socs2 
was amplified using a pair of specifically designed primers (forward primer, 
5'-CGTTGCCGAAGCCAAGGAGA-3’; reverse primer, 5’-AGGGATGCGAG 
CGGGGATAA-3’; product, 624bp). PCR products were purified using the 
QlAquick Gel Extraction kit (Qiagen) and cloned into pGEM-T vector 
(Promega). All the antisense probes were prepared by linearizing the plasmid with 
Ncol and transcribing with SP6 polymerase, whereas the sense probes were pre- 
pared by linearizing the plasmid with Spel and transcribing with T7 polymerase. 
The probe of Hoxd12 was provided by C. Tabin**. All of the probes are labelled 
with digoxigenin (Roche) and hybridized with the chicken embryos at 70 °C. The 
procedure of ISH followed a previously published protocol’. 
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The evolution of the amniotic egg was one of the great evolutionary 
innovations in the history of life, freeing vertebrates from an oblig- 
atory connection to water and thus permitting the conquest of 
terrestrial environments’. Among amniotes, genome sequences 
are available for mammals and birds?“*, but not for non-avian 
reptiles. Here we report the genome sequence of the North 
American green anole lizard, Anolis carolinensis. We find that 
A. carolinensis microchromosomes are highly syntenic with chicken 
microchromosomes, yet do not exhibit the high GC and low repeat 
content that are characteristic of avian microchromosomes’. Also, 
A. carolinensis mobile elements are very young and diverse—more 
so than in any other sequenced amniote genome. The GC content 
of this lizard genome is also unusual in its homogeneity, unlike 
the regionally variable GC content found in mammals and birds’. 
We describe and assign sequence to the previously unknown 
A. carolinensis X chromosome. Comparative gene analysis shows 
that amniote egg proteins have evolved significantly more rapidly 
than other proteins. An anole phylogeny resolves basal branches to 
illuminate the history of their repeated adaptive radiations. 

The amniote lineage divided into the ancestral lineages of mammals 
and reptiles ~320 million years ago. Today, the surviving members of 
those lineages are mammals, comprising ~4,500 species, and reptiles, 
containing ~17,000 species. Within the reptiles, the two major clades 
diverged ~280 million years ago: the lepidosaurs, which contains 
lizards (including snakes) and the tuatara; and the archosaurs, contain- 
ing crocodilians and birds (the position of turtles remains unclear)°. 
For simplicity, we will refer here to lepidosaurs as lizards (Fig. 1). 

The study of the major genomic events that accompanied the trans- 
ition to a fully terrestrial life cycle has been assisted by the sequencing 
of several mammal (K.L.-T. et al, manuscript submitted) and three 
bird genomes**. The genome of the lizard A. carolinensis thus fills an 
important gap in the coverage of amniotes, splitting the long branch 
between mammals and birds and allowing more robust evolutionary 
analysis of amniote genomes. 


For instance, almost all reptilian genomes contain microchromo- 
somes, but these have only been studied at a sequence level in birds”’, 
raising the question as to whether the avian microchromosomes’ 
peculiar sequence features are universal across reptilian microchromo- 
somes*. Another example is the study of sex chromosome evolution. 
Nearly all placental and marsupial mammals share homologous sex 
chromosomes (XY)* and all birds share ZW sex chromosomes. 
However, lizards exhibit either genetic or temperature-dependent 
sex determination’®. Characterization of lizard sex chromosomes 
would allow the study of previously unknown sex chromosomes and 
comparison of independent sex chromosome systems in closely related 
species. 

Anolis lizards comprise a diverse clade of ~400 described species 
distributed throughout the Neotropics. These lizards have radiated, 
often convergently, into a variety of ecological niches with attendant 
morphological adaptations, providing one of the best examples of 
adaptive radiation. In particular, their diversification into multiple rep- 
licate niches on diverse Caribbean islands via interspecific competition 
and natural selection has been documented in detail’’. A. carolinensis is 
the only anole native to the USA and can be found from Florida and 
Texas up to North Carolina. We chose this species for genome sequen- 
cing because it is widely used as a reptile model for experimental ecology, 
behaviour, physiology, endocrinology, epizootics and, increasingly, 
genomics. 

The green anole genome was sequenced and assembled (AnoCar 2.0) 
using DNA from a female A. carolinensis lizard (Supplementary Tables 
1-4). Fluorescence in situ hybridization (FISH) of 405 bacterial arti- 
ficial chromosome (BAC) clones (from a male) allowed the assembly 
scaffolds to be anchored to chromosomes (Supplementary Table 5 and 
Supplementary Fig. 1). The A. carolinensis genome has been reported 
to have a karyotype of n = 18 chromosomes, comprising six pairs of 
large macrochromosomes and 12 pairs of small microchromosomes”’. 
The draft genome sequence is 1.78 Gb in size (see Supplementary 
Table 3 for assembly statistics) and represents an intermediate 
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Figure 1 | Amniote phylogeny based on protein synonymous sites showing 
major features of amniote evolution. Major characteristics of lizard evolution 
including homogenization of GC content, high sex chromosome turnover and 


between genome assemblies of birds (0.9-1.3Gb) and mammals 
(2.0-3.6 Gb). 

We find that few chromosomal rearrangements occurred in the 280 
million years since anole and chicken diverged, as had been hinted at 
by previous comparisons using Xenopus and chicken’’. There are 259 
syntenic blocks (defined as consecutive syntenic anchors that are con- 
sistent in order, orientation and spacing, at a resolution of 1 Mb) 
between lizard and chicken (Supplementary Table 6 and Supplemen- 
tary Fig. 2). Interestingly, 19 out of 22 anchored chicken chromosomes 
are each syntenic to a single A. carolinensis chromosome over their 
entire lengths (Fig. 2a); by contrast, only 6 (of 23) human chromo- 
somes are syntenic to a single opossum chromosome over their entire 
lengths, even though the species diverged only 148 million years ago™*. 
Segmental duplications follow trends seen in other amniote genomes 
(Supplementary Note, Supplementary Table 7 and Supplementary 
Fig. 3). 

Approximately 30% of the A. carolinensis genome is composed of 
mobile elements, which comprise a much wider variety of active repeat 
families than is seen for either bird’ or mammalian’ genomes. The 
most active classes are long interspersed (LINE) elements (27%) and 
short interspersed (SINE) elements (16%)'° (Supplementary Table 8). 
The majority of LINE repeats belong to five groups (L1, L2, CR1, RTE 
and R4) and seem to be recent insertions based on their sequence 
similarity (divergence ranges from 0.00—-0.76%; ref. 17). This contrasts 
with observations of mammalian genomes, where only a single family 
of LINEs—L1—has predominated over tens of millions of years. The 
DNA transposons comprise at least 68 families belonging to five super- 
families: hAT, Chapaev, Maverick, Tc/Mariner and Helitron’’. As with 
retrotransposons, the majority of DNA transposon families seem to be 
relatively young in contrast to the extremely few recently active DNA 
transposons found in other amniote genomes (Supplementary Table 9). 
Overall, A. carolinensis mobile elements feature significantly higher GC 
content (43.5%, P< 10° *°) than the genome-wide average of 40.3%. In 
addition to mobile elements, A. carolinensis exhibits a high density 
(3.5%) of tandem repeats, with length and frequency distributions 
similar to those of human microsatellite DNA’*. We now know that 
amniote genomes come in at least three types: mammalian genomes are 
enriched for L1 elements and have a high degree of mobile element 
accumulation, bird genomes are repeat poor with very little mobile 
element activity, while the lizard genome contains an extremely wide 
diversity of active mobile element families but has a low rate of accu- 
mulation, which is reminescent of the mobile element profile of 
teleostean fishes”. 
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high levels of repeat insertion are featured. Sex chromosome inventions are 
indicated in red. Branch length is proportional to dS (the synonymous 
substitution rate); dS of each branch is indicated above the line. 


Most reptile genomes contain microchromosomes, but the numbers 
vary among species; the A. carolinensis genome contains 12 pairs of 
microchromosomes”, whereas the chicken genome contains 28 pairs. 
Bird microchromosomes have very distinctive properties compared to 
bird macrochromosomes, such as higher GC and lower repeat contents’, 
whereas lizard microchromosomes do not exhibit these features 
(Fig. 2b). Remarkably, all sequence anchored to microchromosomes 
in A. carolinensis also aligns to microchromosomes in the chicken 
genome, and all but one A. carolinensis microchromosome is syntenic 
to only a single corresponding chicken microchromosome (Fig. 2a). 
Microchromosomes conserved between A. carolinensis and chicken thus 
could have arisen in the reptile ancestor, whereas the remaining chicken 
microchromosomes could be derived in the bird lineage. Alternatively, 
the remaining chicken microchromosomes could have been present in 
the reptile ancestor but fused to form macrochromosomes in the lizard 
lineage. 

The A. carolinensis genome has surprisingly little regional variation 
of GC content, substantially less than previously observed for birds and 
mammals; it is the only amniotic genome known whose nucleotide 
composition is as homogenous as the frog genome® (Supplementary 
Figs 4 and 5). Figure 3 illustrates how local GC content is evolutionarily 
conserved between human chromosome 14 and chicken chromosome 
5, but to a much lesser degree with A. carolinensis chromosome 1. As 
all sequenced amniote genomes other than A. carolinensis contain 
these homologous varying levels of GC content (“‘isochores’)”®, the 
ancestral amniote GC heterogeneity is likely to have eroded towards 
homogeneity in this lizard’s lineage. It has been proposed that iso- 
chores with high GC content are a consequence of higher rates of 
GC-biased gene conversion in regions of higher recombination’. The 
greater GC homogeneity in the anole genome may thus reflect more 
uniform recombination rates, or else a substantially reduced bias 
towards GC during the resolution of gene conversion events in the 
A. carolinensis lineage (for a discussion, see ref. 5). 

Both temperature-dependent sex determination and XY genetic sex 
determination have been found in Iguania’®. Within the genus Anolis, 
there are species with heteromorphic XY chromosomes (including 
those with multiple X and Y chromosomes), and others with entirely 
homomorphic chromosomes’*. A. carolinensis is known to have 
genetic sex determination”', but the form of its sex chromosomes 
(ZW or XY) has thus far been unknown owing to a lack of obviously 
heteromorphic chromosomes. 

In depth examination of male and female cells using FISH allowed 
us to identify the microchromosome previously designated as ‘b’ as the 
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Figure 2 | A. carolinensis-chicken synteny map reveals synteny of reptile 
microchromosomes but dissimilar GC and repeat content. a, Very few 
rearrangements have occurred in the 280 million years since A. carolinensis and 
chicken diverged. A. carolinensis microchromosomes are exclusively syntenic 
to chicken microchromosomes. Horizontal coloured bars depict the six 

A. carolinensis macrochromosomes (1-6) and the six (of 12) A. carolinensis 
microchromosomes that have sequence anchored to them that is syntenic to the 
chicken genome (7, 8, 9, X, LGg, LGh). Chromosomes that could be ordered by 
size were assigned a number; the smaller microchromosomes that could not be 
distinguished by size were assigned a lowercase letter. Each colour corresponds 
to a different chicken chromosome as indicated in the key. Any part of an 

A. carolinensis chromosome that is syntenic to a chicken microchromosome is 
indicated by ‘m’. b, Chicken microchromosomes have both higher GC content 
and lower repeat content than chicken macrochromosomes, whereas 

A. carolinensis chromosomes do not vary in GC or repeat content by 
chromosome size. Large circles designate the GC percentage of each 
chromosome in the chicken and lizard genomes with greater than 100 kb of 
sequence anchored to it. Small circles designate the percentage of the genome 
made up of repetitive sequence of each chromosome in the chicken (blue 
circles) and lizard (red circles) genomes. 


A. carolinensis X chromosome; it is present in two copies in females 
and one in males. This chromosome is syntenic to chicken microchro- 
mosome 15. Eleven BACs assigned to two scaffolds, 154 (3.3 Mb) and 
chrUn0090 (1.8 Mb), hybridize via FISH to the p arms of the two X 
chromosomes in females, and hybridize to the p arm of the single X 
chromosome in males (Fig. 4 and Supplementary Fig. 1). A. carolinensis 
thereby shows a pattern representative of a male heterogametic system 
of genotypic sex determination. We have not identified the Y chro- 
mosome, but we hypothesize that A. carolinensis possesses both X 
and Y chromosomes, as both male and female cells contain the same 
number of chromosomes. 

The 5.1 Mb of sequence assigned to the X chromosome contains 62 
protein-coding genes (Supplementary Table 10); Gene Ontology (GO) 
terms associated with these genes show no significant enrichment. It is 
very likely that there is more X chromosome sequence that is currently 
labelled as unanchored scaffolds in the AnoCar 2.0 assembly. Identi- 
fication of the A. carolinensis sex determination gene will require 
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Figure 3 | The A. carolinensis genome lacks isochores. The A. carolinensis 


genome shows only very local variation in GC content, unlike the human and 
chicken genomes, which also show larger trends in GC variation, sometimes 
called isochores. Syntenic regions of human chromosome 14, chicken 
chromosome 5 and A. carolinensis chromosome 1 are shown. The human and 
chicken regions are inverted and rearranged to align with the A. carolinensis 
region. Blue lines depict GC percentage in 20-kb windows. The purple line 
designates the genome average. Green lines represent examples of syntenic 
anchors between the three genomes. 


considerable functional biology, but we note that the chicken sex 
determination gene DMRT1 is located on A. carolinensis chromosome 
2 and that SOX3 (the X chromosome paralogue of the therian mammal 
sex determination gene SRY) is located on an unanchored A. caroli- 
nensis scaffold; these genes are thus unlikely to be the A. carolinensis 
sex determination gene. 

All ten A. carolinensis individuals (originating from South Carolina 
and Tennessee) used for FISH mapping showed large pericentromeric 
inversions in one or more of chromosomes 1-4, with no correlation 
between different chromosomal inversions or with the sex of the lizard 
(see Supplementary Note, Supplementary Table 11 and Supplemen- 
tary Fig. 6). 

A total of 17,472 protein-coding genes and 2,924 RNA genes were 
predicted from the A. carolinensis genome assembly (Ensembl release 
56, September 2009). We built a phylogeny for all A. carolinensis genes 
and their homologues in eight other vertebrate species (human, 


a b 

Figure 4| The A. carolinensis genome contains a newly discovered X 
chromosome. a, b, The X chromosome, a microchromosome, is found in one 
copy in male A. carolinensis (a) and in two copies in females (b). The BAC 
206M13 (CHORI-318 BAC library) is hybridized to the p arm of the X 
chromosome using FISH in both male and female metaphase spreads. 206M13 


and ten other BACs showed this sex-specific pattern in cells derived from five 
male and five female individuals. Original magnification, < 1,000. 
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mouse, dog, opossum, platypus, chicken, zebra finch and pufferfish), 
allowing us to identify a conservative set of 3,994 one-to-one ortholo- 
gues, that is, genes that have not been duplicated or deleted in any of 
these vertebrates since their last common ancestor. These gene phylo- 
genies were also used to identify genes that arose by duplication in the 
lizard lineage after the split with the avian lineage and, separately, those 
that were lost in the mammalian lineage after the mammal-reptile split 
(Fig. 1, Supplementary Note, Supplementary Fig. 7 and Supplementary 
Table 12). 

We found 11 A. carolinensis opsin genes that have no mammalian 
orthologues (but have orthologues in invertebrates, fishes and frog), 
and thus seem to have been lost during mammalian evolution (Sup- 
plementary Table 13). The large repertoire of opsins may contribute to 
the excellent colour vision of anoles—including the ability to see in the 
ultraviolet range—and also may contribute to their hyperdiversity by 
allowing the evolution of diverse, species-specific colouration of the 
dewlap, which has an important role in sexual selection and species 
recognition". Similarly, olfactory receptor and f-keratin genes are 
highly duplicated in A. carolinensis (Supplementary Note and Sup- 
plementary Fig. 9). 

Many reptiles, including green anoles, differ from placental mammals 
in being oviparous (laying eggs). Vivipary in placental mammals is a 
derived state, reflected in their loss of some egg-related genes. We used 
mass spectrometry to identify proteins present in the immature 
A. carolinensis egg, as most egg proteins are produced in the mother’s 
body and then transported into the immature egg. We found that in 
contrast with mammals, reptiles have lineage-specific gene duplica- 
tions, including in vitellogenins (VTGs), apovitellenin-1, ovomucin-« 
and three homologues of ovocalyxin-36, a chicken eggshell matrix 
protein. 

Our results show rapid evolution of egg protein genes among 
amniotes. Specifically, we found proteins from 276 A. carolinensis 
genes in immature A. carolinensis eggs (Supplementary Tables 14 
and 15), of which only 50 have been confirmed to be present in chicken 
eggs by mass spectrometry**”’. These genes include VTGs, a lysozyme, 
vitelline membrane outer layer protein 1 (VMO1) paralogues, protease 
inhibitors, natterin and nothepsin. By aligning genes that are one-to- 
one orthologues in A. carolinensis and chicken, we found that egg 
proteins evolve significantly more rapidly than non-egg proteins 
(mean dN/dS values (ratio of the rate of non-synonymous substitu- 
tions to the rate of synonymous substitutions) of 0.186 and 0.135, 
respectively; P= 1.2 X 10°), which reflects reduced purifying selec- 
tion and/or more frequent episodes of adaptive evolution. 

Using multiple vertebrate genome sequences, we identified three 
VMOI1 paralogues (which we name «, B and y) that we infer to have 
been present in the last common ancestor of all reptiles and mammals. 
Whereas at least one of VMO1-0, VMO1-B and VMO1- has been lost 
in all other amniote genomes, the A. carolinensis genome contains 
representatives of all three paralogues. Moreover, the A. carolinensis- 
specific VMO1-o family has grown to 13 members and has experi- 
enced positive selection of amino acid substitutions within a negatively 
charged, probably substrate-binding cavity; changes that, presumably, 
modify its lysozyme-like transferase activity (Supplementary Note, 
Supplementary Fig. 8 and Supplementary Tables 16 and 17). 

The extensive and active repeat repertoire of A. carolinensis has 
allowed us to discover the origin of several mammalian conserved ele- 
ments. Through the process of exaptation (a major change in function of 
a sequence during evolution), certain mobile elements that were active 
in the amniote ancestor have become conserved, and presumably func- 
tional, in mammals, while remaining active mobile elements in 
A. carolinensis. The origin of these conserved mammalian sequences 
in mobile elements was not recognizable without comparison to a dis- 
tant and repeat-rich genome sequence™. We identified 96 such exapted 
elements (see Supplementary Table 18) in the human genome tracing 
back to mobile elements present in the amniote ancestor that are still 
present in A. carolinensis, particularly the CR1, L2 and gypsy families. 
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Although most exapted elements are non-coding and probably 
serve a regulatory function, we also identified a protein-coding exon 
that was exapted from an L2-like LINE, now constituting exon 2 ina 
mammal-specific N-terminal region of the MIERI (mesoderm induc- 
tion early response 1) protein. This exon is highly conserved across 29 
mammals and therefore probably represents a mammalian innovation 
since the amniote ancestor. 

GO terms associated with the transcription start site closest to each 
exapted element in the human genome show enrichment for neuro- 
developmental genes (see Methods), with “ephrin receptor binding”, 
“nervous system development” and “synaptic transmission” being 
strongly enriched (all P values<5 X10 *). These enrichments are 
consistent with adaptive changes in neurodevelopment occurring 
during the emergence of mammals. 

Anolis lizards are a textbook case of adaptive radiation, having 
diversified independently on each island in the Greater Antilles and 
throughout the Neotropics, producing a wide variety of ecologically 
and morphologically differentiated species, with as many as 15 found 
ata single locality'’. Although anoles are widely used as a model system 
for phylogenetic comparative studies, it has been difficult to determine 
the evolutionary relationships among major anole clades owing to 
rapid evolutionary radiations associated with access to new dimen- 
sions of ecological opportunity. Successfully resolving the relatively 
short branching events associated with such a radiation requires a 
wealth of data from loci evolving at an appropriate rate. 

We used the genome sequence of A. carolinensis to develop a new 
phylogenomic data set comprised of 20 kb of sequence data sampled 
from across the genomes of 93 species of anoles (Supplementary 
Tables 19 and 20). Analyses of this data set infer a well-supported 
phylogeny that reinforces and clarifies the adaptive and biogeographic 
history of anoles (Fig. 5, details in Supplementary Fig. 10). First, our 
phylogenomic analysis reaffirms previous molecular and morpho- 
logical studies indicating that similar anole habitat specialists have 
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Figure 5 | A phylogeny of 93 Anolis species clarifies the biogeographic 
history of anoles. Anolis ecomorphs derive from convergent evolution and not 
from frequent inter-island migration. Using conserved primer pairs distributed 
across the genome of A. carolinensis, we obtain sequences from 46 genomically 
diverse loci evolving at a range of evolutionary rates and representing both 
protein-coding and non-coding regions. Maximum likelihood analyses of this 
new data set of 20 kb aligned nucleotides infer nearly all previously established 
anole relationships while also partially resolving the basal relationships that 
have plagued previous studies. Open circles indicate bootstrap (bs) values <70; 
grey-shaded circles, 70< bs <95; filled circles, bs >95. 
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evolved independently on each of the four large Greater Antillean 
islands. Second, our analyses suggest a complex biogeographic scenario 
involving a limited number of dispersal events between islands and 
extensive in situ diversification within islands. The closest relatives of 
Anolis occur on the mainland and the phylogeny confirms the existence 
of two colonizations, one into the southern Lesser Antilles and the 
second producing the diverse adaptive radiations throughout the rest 
of the Caribbean. Within this latter clade, anoles initially diversified 
primarily on the two larger Greater Antillean islands (although Puerto 
Rico also seems to have been involved) before subsequently undergoing 
secondary radiations on all of the islands and eventually returning to the 
mainland, where this back-colonization has produced an extensive 
evolutionary radiation. The phylogeny also indicates that very few 
inter-island dispersal events occurred in Greater Antillean evolution. 
Rather, the Greater Antillean faunas, renowned for the extent to which 
the same ecomorphs are found on each island, are primarily the result of 
convergent evolution”>. 

The genome sequence of A. carolinensis allows a deeper understand- 
ing of amniote evolution. Filling this important reptilian node with a 
sequenced genome has revealed derived states in each major amniote 
branch and has helped to illuminate the amniote ancestor. However, 
the tree of sequenced reptilian genomes is still extremely sparse, and 
the sequencing of additional non-avian reptiles would be necessary to 
fully understand how typical A. carolinensis and the sequenced bird 
genomes are of the entire reptile clade. 

In addition to the utility of the A. carolinensis genome sequence as a 
representative of non-avian reptiles, Anolis species are a unique 
resource for the study of adaptive radiation and convergent evolution. 
With their invasions of and subsequent radiations on Caribbean 
islands, anoles provide a terrestrial analogue to stickleback and cichlid 
fish, which underwent adaptive evolution in separate aquatic environ- 
ments. Just as genomic research in sticklebacks has deepened the study 
of aquatic ecological speciation, a large-scale genomic phylogenetic 
survey of the Caribbean anoles would be an opportunity for detailed 
study of adaptive evolution in a land animal”; in particular because 
anole genomes contain large numbers of active mobile elements that 
we speculate could form substrates for exaptation of novel regulatory 
elements. 


METHODS SUMMARY 


A full description of methods, including sample collection, sequencing, assembly, 
anchoring, mass spectrometry and all sequence analysis, can be found in Sup- 
plementary Information. All animal experiments were approved by the MIT 
Committee for Animal Care. 
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Innate immune recognition of bacterial ligands by 
NAIPs determines inflammasome specificity 


Eric M. Kofoed! & Russell E. Vance! 


Inflammasomes are a family of cytosolic multiprotein complexes 
that initiate innate immune responses to pathogenic microbes by 
activating the caspase 1 protease’’. Although genetic data support 
a critical role for inflammasomes in immune defence and inflam- 
matory diseases’, the molecular basis by which individual inflam- 
masomes respond to specific stimuli remains poorly understood. 
The inflammasome that contains the NLRC4 (NLR family, CARD 
domain containing 4) protein was previously shown to be activated 
in response to two distinct bacterial proteins, flagellin*’ and PrgJ°, 
a conserved component of pathogen-associated type III secretion 
systems. However, direct binding between NLRC4 and flagellin or 
PrgJ has never been demonstrated. A homologue of NLRC4, 
NAIP5 (NLR family, apoptosis inhibitory protein 5), has been 
implicated in activation of NLRC4 (refs 7-11), but is widely 
assumed to have only an auxiliary role’”, as NAIP5 is often dis- 
pensable for NLRC4 activation”*. However, Naip5 is a member of a 
small multigene family”’, raising the possibility of redundancy and 
functional specialization among Naip genes. Here we show in mice 
that different NAIP paralogues determine the specificity of the 
NLRC4 inflammasome for distinct bacterial ligands. In particular, 
we found that activation of endogenous NLRC4 by bacterial PrgJ 
requires NAIP2, a previously uncharacterized member of the 
NAIP gene family, whereas NAIP5 and NAIP6 activate NLRC4 
specifically in response to bacterial flagellin. We dissected the 
biochemical mechanism underlying the requirement for NAIP 
proteins by use of a reconstituted NLRC4 inflammasome system. 
We found that NAIP proteins control ligand-dependent oligomer- 
ization of NLRC4 and that the NAIP2-NLRC4 complex physically 
associates with PrgJ but not flagellin, whereas NAIP5-NLRC4 
associates with flagellin but not PrgJ. Our results identify NAIPs 
as immune sensor proteins and provide biochemical evidence for a 
simple receptor-ligand model for activation of the NAIP-NLRC4 
inflammasomes. 

A fundamental question in immunology is how host defence is 
initiated in response to specific microbial ligands. The inflammasome 
containing the NLRC4 protein activates caspase 1 (CASP1) in response 
to the carboxy terminus of bacterial flagellin®’, as well as in response to 
the inner rod protein of the type III secretion systems of diverse 
bacterial species (for example, Prg] of Salmonella enterica serovar 
Typhimurium). Activated CASP1 processes interleukin (IL)-1B and 
IL-18 inflammatory cytokines and induces a rapid and inflammatory 
host cell death called pyroptosis'’. In certain cases, NLRC4 activation 
requires NAIP5, as NaipS ‘ ~ mice fail to activate NLRC4 or CASP1 in 
response to infection with Legionella pneumophila or in response to the 
C terminus of flagellin”®. Interestingly, however, NAIP5 is not essential 
for NLRC4 activation in response to S. enterica Typhimurium or 
Prgy’®. 

In addition to Naip5, C57BL/6 mice express three other Naip genes 
(Naip1, Naip2 and Naip6), the functions of which remain unknown”. 
We hypothesized that each NAIP paralogue may have evolved to be 
specific for a unique bacterial ligand. We first focused on NAIP2, as it 
appeared to be highly expressed in C57BL/6 mice’. We used specific 


short hairpin RNAs (shRNAs) to knock down Naip2 expression in 
primary bone-marrow-derived macrophages. ShRNA1 and shRNA2 
specifically reduced NAIP2 protein levels without targeting other 
NAIP paralogues, whereas empty vector, shRNA3 or a scrambled 
control shRNA had little effect on NAIP2 protein levels (Supplemen- 
tary Fig. la, b). Macrophages expressing these shRNAs were then 
infected with flagellin-deficient Listeria strains that inducibly express 
PrgJ (Listeria-PrgJ) or flagellin (Listeria-FlaA)*. A Listeria-based system 
was chosen because it is an efficient means for delivering PrgJ to macro- 
phages®, and because it allows for controlled comparisons of PrgJ and 
FlaA within a single experimental system. Notably, knockdown of Naip2 
prevented pyroptosis and CASP1 activation by Listeria-PrgJ (Fig. la—c). 
By contrast, Naip2 knockdown did not affect inflammasome activation 
by Listeria-FlaA (Fig. 1b, c) or L. pneumophila, which expresses flagellin 
but not PrgJ (Supplementary Fig. 1c). Instead, flagellin-dependent 
inflammasome activation depended on Naip5, as previously shown’™"’. 
Inflammasome activation by wild-type Salmonella, which encodes both 
flagellin and PrgJ, was not significantly affected by Naip2 knockdown 
(Fig. 1d, e). However, knockdown of Naip2 in NaipS ‘~ macrophages 
significantly reduced or abolished inflammasome activation by wild- 
type Salmonella (Fig. 1d, e), indicating that both NAIP2 and NAIP5 
recognize Salmonella. Interestingly, inflammasome activation by 
flagellin-deficient (FliC FIjB”) Salmonella, which still express PrgJ, 
depended entirely on Naip2 (Fig. 1d, e). Taken together, these data 
indicate that Naip2 is specifically required for activation of the NLRC4 
inflammasome by PrgJ, in contrast to Naip5, which seems to be spe- 
cifically required for NLRC4 activation by flagellin. 

Biochemical analysis of the inflammasome in macrophages is com- 
plicated by the expression of multiple NAIP proteins and by their low 
expression levels. We therefore decided to reconstitute the NLRC4 
inflammasome in non-immune 293T cells, which do not express 
NLRC4 or NAIPs, so that the functions of individual NAIP proteins 
could be analysed. 293T cells transiently transfected with green fluor- 
escent protein (GFP)-marked vectors encoding wild-type NLRC4, 
NAIP5 and CASP1 did not exhibit significant spontaneous inflamma- 
some activation, and instead, most cells expressed GFP (Fig. 2a). 
However, when flagellin (FlaA) from L. pneumophila was co-expressed 
with NLRC4, NAIP5 and CASP1, we observed a significant loss of 
GEP"™®" cells and an increase in the number of dead (7AAD*) cells 
(Fig. 2a). This result was highly reminiscent of flagellin-dependent 
activation of the endogenous NAIP5-NLRC4 inflammasome in macro- 
phages, which also results in a rapid CASP1-dependent cell death, loss 
of membrane integrity, and release of cytosolic contents and GFP’. 
Similar to the genetic requirement for Nirc4, Naip5 and Casp1 in macro- 
phages*?”""""°, we found that NAIP5, NLRC4, catalytically active 
CASP1, and FlaA are all required to trigger cell death and loss of mem- 
brane integrity/GFP in reconstituted 293T cells (Fig. 2b, c). The recon- 
stituted NAIP5-NLRC4 inflammasome also recapitulated the ability of 
native inflammasomes to process CASP1 and IL-1B in response to 
cytosolic flagellin (Supplementary Fig. 2). Consistent with a lack of a 
role for NAIP5 in recognition of PrgJ by macrophages’, the reconstituted 
NAIP5-NLRC4 inflammasome did not respond to PrgJ (Fig. 2d, e). By 
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Figure 1 | NAIP2 is required in macrophages for inflammasome activation 
in response to PrgJ. a-c, Primary bone-marrow-derived macrophages 
expressing shRNAs targeting Naip2 (N2) (or controls) were infected with 
flagellin-deficient L. monocytogenes (multiplicity of infection = 5) expressing a 
secreted ActA100-PrgJ (pPrgJ) or ActA100-FlaA (pFlaA) fusion protein under 
IPTG-inducible control. Cell death (+s.d.) was measured in triplicate by LDH 
release 6h after infection (a, b), or active CASP1 (p10) was measured by 
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Figure 2 | Reconstitution of the NAIP5-NLRC4 inflammasome in 293T 
cells. a, GFP-marked expression vectors encoding NLRC4, NAIP5, CASP1 
and/or flagellin (FlaA) were transiently transfected into 293T cells. Cells were 
imaged for differential interference contrast (DIC) and GFP fluorescence 48h 
later. Dead cells were stained with 7AAD. b, c, GFP" cells (b) and 7AAD* 
cells (c) were quantified (+s.d.) as in a, but with specific expression vectors 
omitted from the transfection as indicated. CASP1(C284<A) is a catalytically 
dead mutant. d, e, 293T cells were transfected as indicated and analysed as 
above. Data shown (+s.d.) are representative of at least three independent 
experiments. *P < 0.02 (Student’s t-test, two tailed); NS, not significant. 
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western blotting of cell supernatants (c). d, e, NAIP2 knockdown cells were 
infected with wild-type or flagellin-deficient (FliC” FIjB”) S. enterica 
Typhimurium and inflammasome activation was measured by LDH release 
(+s.d.) at 3h after infection (d) or CASP1 processing (e). Data shown are 
representative of two (c, e) or three (a, b, d) independent experiments. 

*P < 0.02 as compared to scramble (Student’s t-test, two-tailed). 


contrast, a reconstituted NAIP2-NLRC4 inflammasome responded 
specifically to PrgJ but not flagellin (Fig. 2d, e). Taken together, we 
conclude that we have successfully reconstituted NAIP2-NLRC4 and 
NAIP5-NLRC4 inflammasomes that exhibit all the known require- 
ments and specificities of the native inflammasomes. 

It is believed that activated inflammasomes assemble into high- 
molecular-mass multiprotein complexes’*, but this has not been 
demonstrated for the NLRC4 inflammasome. To visualize inflamma- 
some assembly, 293T cells were transfected with NAIP5, NLRC4 and 
FlaA in various combinations, but CASP1 was omitted so that cell 
death and loss of cellular contents (and assembled inflammasomes) 
would not occur. Digitonin-solubilized cell lysates were resolved on 
blue native (BN) polyacrylamide gel electrophoresis (PAGE) gels’”. A 
marked shift of NLRC4 from a monomer (~ 120 kDa) to an oligomeric 
complex (~ 1,000 kDa) was seen in the presence of NAIP5 and FlaA. 
NAIP5 was also contained within the high-molecular-mass oligomeric 
complex (Fig. 3a). The association of NAIP5 and NLRC4 in the same 
complex was validated by co-immunoprecipitation (Supplementary 
Fig. 3)'"'*. NLRC4 oligomerization was induced by either untagged 
FlaA or a GFP-FlaA fusion protein (Fig. 3a), both of which activate 
NLRC4-CASP1. Importantly, assembly of the NLRC4 inflammasome 
required FlaA (Supplementary Fig. 4a) and was not observed in the 
absence of NAIP5 (Fig. 3a), indicating that a biochemical function of 
NAIP5 is to promote NLRC4 oligomerization. 

Despite strong genetic evidence that NLR proteins, such as NAIP5 
and NLRC4, function as microbial ‘sensors’, there is no biochemical 
evidence that NLRs interact directly with microbial ligands. In fact, 
some studies of the NLRP3 inflammasome’?”’, as well as analyses of 
analogous proteins from plants”, suggest that at least some NLRs 
recognize pathogens indirectly. To determine if the oligomerized 
NAIP5-NLRC4 complex also contains flagellin, we subjected samples 
separated in the first dimension by native PAGE to a second dimension 
of SDS-PAGE. To facilitate detection of flagellin, we used a 6 Myc- 
tagged flagellin, which activates the inflammasome identically to native 
flagellin (data not shown). This approach revealed that FlaA was indeed 
present in a high-molecular-mass complex, along with NAIP5 and 
NLRC4 (Fig. 3b). NAIP5 exhibited a weak flagellin-dependent mobility 
shift in the absence of NLRC4 (Supplementary Fig. 4b), indicating that 
NLRC4 is not essential for flagellin recognition, although formation/ 
stabilization of the oligomerized complex seems to be significantly 
enhanced by NLRC4. FlaA expressed alone was present in cell extracts 


29 SEPTEMBER 2011 | VOL 477 | NATURE | 593 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


- +4 - + + - + - + NAIP5S 


NLRC! + + + + -— -— — — NAIPS(ALRR) 
ica as AT are Daas - = - = + + + + NAIP5S 
- - - - + + + + — GFP-FlaA - + + - = + + — NLRC4 
(kDa) = - = ~~ - - - + FlaA - - + + 7 7 + + 6xMyc-FlaA 
nee o Oligomer 
w ++ Oligomer ¢ 9 
=| 700 WB: 7 WB: 
e NLRC4 2 NLRC4 
S 
3 * NS 2 ENS 
x | 
NI ob 
o Monomer Monomer 
WB:NAIPS WB:NAIP5 
ai Oligomer g Ful I-length 
; 3) co rE 
x A 
x ~ - 
S \e;-<cin | ES ie:9-acn 
+ 
b d a NLRC4(ALRR) 
Transfection: = - = + + + - = = NLRC4(K175R) 
NAIPS5,NLRC4,6x-Myc-FlaA aK + + + K175R(ALRR) 
3-12% 1D native PAGE - = + - - + — - + NAIPS 
(kDa) 4048 480 242 146 = eet eas ge GRR blah 
7 f | 1 g WB:NLRC4 
a (5150 aod ~+ Oligomer 
st . 
& d/100 WB:NAIPS 
+) 75 


WB:NLRC4 


Oligomerized 


SDS-PAGE 


WB:GFP 


WB:f-actin 


Figure 3 | NAIP5 is required for formation of a hetero-oligomeric complex 
that contains NLRC4, NAIP5 and flagellin. a, 293T cells were transfected as 
indicated, followed by analysis by blue native PAGE or SDS-PAGE, and 
western blotting. *NS, nonspecific band. b, 293T cells were transfected as 
indicated and lysates were separated by a first dimension of blue native PAGE 
followed by a second dimension of SDS-PAGE. ¢, d, 293T cells were transfected 
as indicated and samples were processed and analysed as in a. Data shown are 
representative of at least three independent experiments. 


only as a monomer (Supplementary Fig. 4c). Taken together, these 
observations provide evidence for a simple receptor-ligand model of 
NAIP5-NLRC4 activation by flagellin. 

Consistent with the autoinhibitory function of the leucine-rich 
repeats (LRRs) in other NLRs, we found that NAIP5(ALRR) and 
NLRC4(ALRR) constitutively activated CASP1-dependent cell death, 
independent of the presence of flagellin (Supplementary Fig. 5). 
Interestingly, NLRC4(ALRR) was able to activate CASP1 in the 
absence of NAIP5, whereas constitutively active NAIP5(ALRR) 
required wild-type NLRC4 to activate CASP1. This result suggests that 
NAIP5 functions upstream of NLRC4. Indeed, NAIP5(ALRR) was able 
to induce the oligomerization of wild-type NLRC4 (Fig. 3c), whereas 
the spontaneous oligomerization of NLRC4(ALRR) did not require 
NAIP5 (Fig. 3d). Spontaneous oligomerization of NLRC4(ALRR) 
did require the nucleotide-binding domain (NBD) of NLRC4, as a 
K175R mutation previously shown to disrupt NBD function” 
abolished NLRC4(ALRR) auto-oligomerization (Fig. 3d). The ability 
of NAIP5 to induce oligomerization of NLRC4 in response to flagellin 
required both the NBD and amino-terminal BIRs of NAIP5, but did 
not require the N-terminal CARD of NLRC4 (Supplementary Fig. 4d, 
e), whereas functional CASP1 activation required all of these domains 
(Supplementary Figs 5 and 6). Taken together, these data are indi- 
cative of a working model (Supplementary Fig. 7) in which NAIP5 is 
activated by flagellin and induces downstream NLRC4 oligomeriza- 
tion and CASP1 activation. 

Consistent with a specific role for NAIP2 in recognition of PrgJ, we 
found that PrgJ did not induce the oligomerization of NAIP5-NLRC4 
(Fig. 4a) but did induce oligomerization of NAIP2-NLRC4. Oligo- 
merization of NLRC4 did not occur when co-expressed with NAIP2 
alone or with NAIP2 and FlaA (Fig. 4b). Interestingly, NAIP6 resembled 
NAIP5 and supported NLRC4 oligomerization in response to FlaA but 
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Figure 4 | NAIP paralogues confer specificity to the NLRC4 inflammasome. 
a, 293T cells were co-transfected with wild-type NAIP5 and NLRC4, alone or in 
combination with 6x-Myc-FlaA or 6X-Myc-PrgJ followed by blue native 
PAGE 48 h later. *NS, nonspecific band. Whole-cell lysates were also separated 
by conventional 4-12% SDS-PAGE to control for expression of each 
transfected gene construct (left panel). b, 293T cells were transfected with wild- 
type NAIP2 and NLRC4 and analysed as in a. c, 293T cells were transfected with 
wild-type NAIP6 and NLRC4, and analysed as in a. Data shown are 
representative of at least three independent experiments. 


not PrgJ (Fig. 4c), perhaps providing an explanation for the previously 
puzzling observation that Naip5 ‘~ cells can respond to high levels of 
flagellin’. In contrast, NAIP1 is an ‘orphan’ NAIP because it responded 
neither to PrgJ nor flagellin (Fig. 2d, e and Supplementary Fig. 8). 

Our results demonstrate that the ability of the NLRC4 inflamma- 
some to assemble and functionally activate CASP1 in response to spe- 
cific bacterial ligands is dictated by NAIP family members. The most 
parsimonious model to account for our results is that NAIP proteins 
function as direct receptors for bacterial ligands (Supplementary Fig. 7). 
Although NLRC4 was previously suspected to be the cytosolic flagellin 
sensor'”, we hypothesize that a main function of NLRC4 may instead 
be to serve as an adaptor, downstream of NAIP proteins, to recruit 
CASP1 via a CARD-CARD interaction. NLRC4 may also have an 
important function in ligand binding or in stabilizing NAIP- 
NLRC4-ligand complexes, but the specificity of the complexes for 
particular ligands seems to be controlled by NAIP proteins. 

The number and sequence of Naip paralogues vary significantly 
among inbred mouse strains, and have been suggested to be evolving 
rapidly™*. Indeed, the murine Naip locus was originally identified by a 
forward genetic approach which took advantage of the widely varying 
susceptibility of inbred mouse strains to L. pneumophila infection'*”. 
The single known human NAIP orthologue may also exist within a 
rapidly evolving locus”; our results indicate that it will be of great 
interest to establish the specificity of the human NAIP protein. We 
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propose that Naip gene evolution represents a fascinating example of 
the molecular arms race between bacteria and their hosts. 


METHODS SUMMARY 

Naip2 knockdown. Primary C57BL/6 bone marrow cells were transduced with 
pLKO.1-based lentivirus encoding shRNAs that specifically target Naip2 or con- 
trols. Bone marrow cells were differentiated into macrophages by culture in media 
containing macrophage colony-stimulating factor (MCSF). On day 4 of culture, 
transduced macrophages were selected by addition of puromycin (5 pg ml '). On 
day 8 of culture, macrophages were re-plated and infected the next day with 
Listeria monocytogenes or S. enterica Typhimurium expressing flagellin or PrgJ*, 
and inflammasome activation was measured by assaying release of the cytosolic 
enzyme lactate dehydrogenase (LDH) or by western blotting for processed (p10) 
CASP1. 

Reconstituted inflammasome. The inflammasome was reconstituted by trans- 
fection of 293T cells with MSCV2.2-IRES-GFP-based expression vectors encoding 
various mouse (C57BL/6-derived) Naip genes, Nirc4 and caspase 1. Inflamma- 
some oligomerization was assessed in digitonin (1%) lysates using a Bis-Tris 
NativePAGE system (Invitrogen) followed by western blotting. 

Statistical analysis. Statistical differences were calculated with an unpaired two- 
tailed Student’s t-test using GraphPad Prism 5.0b. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice. C57BL/6J (B6) mice were purchased from Jackson Labs and bred at UC 
Berkeley. Naip5 ‘~ mice on a pure B6 background were described previously’. 
Animal experiments were approved by the UC Berkeley Animal Care and Use 
Committee. 

Cell culture. HEK293T cells were grown in complete media (DMEM, 10% FBS, 
100 units ml~' penicillin, 100 pg ml~' streptomycin, 2mM L-glutamine). Bone 
marrow macrophages were differentiated from bone-marrow-derived precursor 
cells using macrophage colony stimulating factor as described previously’. 
Transient transfections. HEK293T cells were plated at 8 X 10° cells per well in 
6-well plates, transfected the next day using Lipofectamine 2000, and collected for 
flow cytometric analysis 48 h later. 

Measurement of cell death. Cell death of HEK293T cells was measured by flow 
cytometry measuring GFP and 7AAD fluorescence. Cells were stained with 7AAD 
(BD Pharmingen) according to the manufacturer’s instructions. Death of immor- 
talized macrophages in response to L. pneumophila and LFn-FlaA was measured by 
following release of the intracellular enzyme lactate dehydrogenase (LDH) as 
described previously’. Infection of immortalized macrophages with L. pneumophila 
was performed using a multiplicity of infection (MOI) of 2, and cell death was 
measured 4h later as described previously’. LFn-FlaA is a recombinant 6X His- 
tagged fusion protein encoding the first (non-enyzmatically active) 263 amino acids 
of lethal factor from Bacillus anthracis fused to full-length flagellin (FlaA) from 
L. pneumophila (J. von Moltke and R.E.V., unpublished). LFn-FlaA and B. anthracis 
protective antigen (PA) were purified from E. coli as described previously’’. 
Endotoxin was removed from these proteins using Detoxi-Gel (Pierce) according 
to the manufacturer’s protocol. 

Western blotting and native PAGE. Blue native gel electrophoresis was per- 
formed using the Bis-Tris NativePAGE system by Invitrogen according to the 
manufacturer’s instructions. Briefly, HEK293T cells were transfected for 48h, 
washed twice with cold PBS, trypsinized for 3 min at 37°C, re-suspended in 
complete media, and pelleted by centrifugation at 400g for 5 min at 4°C. Cell 
pellets were washed twice with cold PBS, followed by re-suspension in 1% digito- 
nin native lysis buffer (50 mM BisTris, 50 mM NaCl, 10% w/v glycerol, 0.001% 
Ponceau S, 1% digitonin, 2mM Na3VO,, 1mM PMSF, 25mM NaF, 1X Roche 
protease inhibitor cocktail (no EDTA), pH 7.2). Cell lysates were triturated and 
incubated on ice for 30 min, and then insoluble cell debris was pelleted by cent- 
rifugation at 16,000g for 30 min at 4 °C. Lysates were quantified for total protein 
using the BCA protein assay (Pierce), equalized for total protein, and separated by 
NativePAGE using the Novex BisTris gel system according to the manufacturer’s 
instructions (Invitrogen). Native gels were soaked in 10% SDS for 5 min before 
transfer to PVDF membrane (Millipore) and conventional western blotting. 
Antibodies used were anti-mIL-1f (R&D systems), anti-caspase 1 p10 (M20) 
(Santa Cruz), anti-NLRC4 (gift of S. Mariathasan and V. Dixit, Genentech), 
anti-NAIP5(961-978)"*, anti-Myc (9E10) (Clontech), anti-B-actin (C4) (Santa 
Cruz), anti-GFP (JL-8) (Clontech), anti- NAIP2(33-46)", anti-rabbit IgG-HRP 
and anti-mouse IgG-HRP (GE Healthcare), anti-goat IgG-HRP (Santa Cruz). In 
some cases, native gels were subjected to a second dimension of electrophoresis 
using SDS-PAGE. A 5.7-cm slice (lane) of natively resolved gel was placed in a 
dish containing 1x Laemmli sample buffer (50 mM Tris-Cl (pH6.8), 100 mM 
DTT, 2% (w/v) SDS, 0.1% bromophenol blue, 10% (v/v) glycerol) for 10 min, 
microwaved on high for 20s, and rocked for another 5 min before loading slice 
into the well of a precast 2D 4-12% SDS-PAGE gel (Invitrogen). For immuno- 
precipitations, digitonin cell extracts (100 j1g total protein) were pre-cleared with 
25 ul of washed Protein-G-sepharose (GE Healthcare), and then cleared extracts 
were incubated with 1 ,1g primary antibody (or isotype controls) overnight at 4 °C, 
and complexes were captured the following day with 25 ul of washed Protein-G- 
sepharose. Bound proteins were eluted by re-suspension in Laemmli sample buf- 
fer, boiled for 5 min and separated by SDS-PAGE. 

Expression constructs. NAIP5 wild-type and mutant constructs were cloned into 
the MSCV2.2 retroviral vector containing an IRES-GFP downstream of the mul- 
tiple cloning site. Expression is driven from the viral LTR and is considerably lower 
than that driven by the CMV promoter (data not shown). In general, wild-type and 
mutant ORFs were amplified between flanking BamHI and NotI sites, and a Kozak 
sequence (GCCACC) was engineered to precede the start codon. The BamHI/Notl 
digested PCR product was cloned into complementary BglII/NotI digested 
MSCV2.2 vector. Wild-type NAIP5(1-1402) and NAIP5(APloop) (A464-487) 
were amplified from pcDNA3 using forward (AAAAGGATCCGCCACCATGGC 
TGAGCATGGGGAGTCCTCCG) and reverse (AAAAGCGGCCGCTTACTCC 
AGGATAACAGGAGAGAATGGGAC) primers. NAIP5(A347) (347-1402) was 
generated using the forward primer (AAAAGGATCCGCCACCATGACC 
TTGAAGTCCTCTGCAGAAG) in combination with wild-type NAIP5 reverse 
primer. NAIP5(ALRR) (1-1039) was PCR cloned into BgllII/Pmel sites of 
MSCV2.2 vector using the wild-type NAIP5 forward primer (see above), and 


reverse primer (GITTAAACTCAGCCACTGCTGTTGAATAAACG). Wild- 
type NLRC4 (1-1024) was PCR amplified from pCDNA3 using forward 
(AAAAGGATCCGCCACCATGAACTTTATAAGGAACAACAGACG) and reverse 
(TTTTGCGGCCGCTTAAGCAGTCACTAGTTTAAAGGTGCC) primers, and 
ligated into BglII/Not!I sites of MSCV2.2-IRES-GFP. NLRC4(K175R) was generated 
by site-directed mutagenesis using the QuickChange protocol (Stratagene) using 
forward (GAGTCTGGCAAAGGGCGATCGACCCTGCTGCAG) and reverse 
(CTGCAGCAGGGTCGATCGCCCTTTGCCAGACTC) primers. NLRC4(ACARD) 
(89-1024) was PCR amplified from pCDNA3 using forward primer (AAAA 
GGATCCGCCACCATGTCTTATCAGGTCACAGAAGAAGACC) paired with 
the wild-type reverse primer. NLRC4(ALRR) (1-656) was PCR amplified from 
pCDNA3 using the reverse primer (TTTTGCGGCCGCTTACTTCCAGTTG 
AAGAACAAAGACACAGC) in combination with the wild-type forward primer. 
Wild-type NAIP2 was PCR cloned from pBluescript(SK-) into the BglII/Not! sites 
of MSCV2.2-IRES-GFP using forward (AAAAGGATCCGCCACCATGGCAGC 
CCAGGGAGAAGCCGTTGAGG) and reverse (TTTTGCGGCCGCTCACTTC 
TGAATGACAGGAGAGAATGGCACTACCC) primers. Wild-type NAIP6 was 
PCR cloned from pBluescript(SK-) into the BglII/Not!l sites of Mscv2.2-IRES-GFP 
using the wild-type NAIP5 forward primer (there is 100% conservation of the first 
26 nucleotides among NAIP1, NAIP5 and NAIP6) and the reverse primer 
(TTTTGCGGCCGCTTACTCCAGGACAACAGGAGAGAACGGGAC). Wild- 
type NAIP1 was PCR cloned from pBluescript(SK-) into the BglII/Notl sites of 
MSCV2.2-IRES-GFP using the wild-type NAIP5 forward primer, and the wild-type 
NAIP6 reverse primer (NAIP1 and NAIP6 share identical C-terminal nucleotides). 

MSCV2.2-caspase 1 and MSCV2.2-caspase 1(C284A) have been described 
previously’’. 

We modified the MSCV2.2 vector to contain a 6-Myc-tag in the MCS to 
facilitate generation of NH>-terminal fusion proteins (6X -Myc-MSCV2.2-IRES- 
GFP). The 6X-Myc-tag from pCS-6X-Myc-SEC24 was PCR amplified using 
forward (AAAAAGATCTATCGATTTAAAGCTATG) and reverse (TTTTGC 
GGCCGCTGG-CCGGCCTGAATTCA) primers for insertion into the BglII site 
of MSCV2.2. 6X-Myc-FlaA was generated by PCR amplifying full-length FlaA 
(L. pneumophila) from MSCV2.2-FlaA’ using forward (AAAAGCGGCCGCAG 
CTCAAGTAATCAACACTAATGTGGC) and reverse (TTTTGTCGACTATC- 
GACCTAACAAAGATAATAC) primers and inserted into NotI/Sall sites. 
6X-Myc-PrgJ was generated by amplifying PrgJ from MSCV2.2-PrgJ using 
forward (AAAAGCGGCCGCATCGATTGCAACTATTGTCCC) and reverse 
(TTTTGTCGACTCATGAGCGTAATAGCGTTTC) primers and insertion into 
NotI/Sall sites. All constructs were fully sequenced to confirm their identity. 
Sequencing primers used for NAIP5: MSCV2.2-F, AAGCCCTTTGTA 
CACCCTAAGCC, MSCV2.2-R, CCTCACATTGCCAAAAGAC; NAIP5seq#1, 
CAGCAAAAGCACTGAACGCC; NAIP5seq#2, ATGAACAAATCCCTCGTAGC; 
NAIP5seq#3, TCACTCCTACCCAAGTCCAC; NAIP5seq#4, CICAGACACA- 
CTTCACTAATGC; NAIP5seq#5, TCCCTTAGTTCCATCACACC; NAIP5seq#6, 
GACCCCTCTCTTTGTAGCAG; NAIP5seq#7, GAGTTTCTTGCTGCCGTGAG; 
NAIP5seq#8, TTAGAGGGTTGT-GGCTGGTGTC; NAIP5seq#9, CTTCACA 
GAGTATTGAGTTCCG; NAIP5seq#10, © TTGAGTTTTCTGGACGATGC; 
NAIP5seq#11, GGACAACTTGCCAAACCTAC. Sequencing primers for NAIP2: 
NAIP2seqF1, TG@GTGATGAGAAAGAGTCAC; NAIP2seqF2, CTTCACAGAGT 
ATTGAGTTCCG; NAIP2seqR1, AGCAAATGGTCAGTGCCGAG; NAIP2seqR2, 
ACATACTGCTGCCACGAAG; NAIP2seqR3, AATCCAGTGTTCTCCCTCG; 
MSCV2.2-F and -R primers (see above). Sequencing primers for NAIP6: 
NAIP6seqF1, CAGAAAGCCTGTTACTGTTGAG; NAIP6seqR1, GATGGAACT 
AAGGGAGAGGTAG; NAIP6seqR2, TCTTGGTCTTCCTGCCTATC; MSCV2.2-F, 
MSCV2.2-R, NAIP5seq#7 (see above). 

Generation of Naip2 shRNA constructs. We used the lentiviral pLKO.1-TRC 
cloning vector (Addgene) to generate three vectors expressing three separate 
shRNAs targeting Naip2 (Naip2 shRNAs 1-3). Naip2 shRNA1 oligos, CCGGG 
CCATTGCCTTTCAACCTATACTCGAGTATAGGTTGAAAGGCAATGGCT 
TTTTG and AATTCAAAAAGCCATTGCCTTTCAACCTATACTCGAGTAT 
AGGTTGAAAGGCAATGGC. Naip2 shRNA2 oligos, CCGGCCATCCAGAAA 
CCTTGTTGTTCTCGAGAACAACAAGGTTTCTGGATGGTTTTTG and AA 
TTCAAAAACCATCCAGAAACCTTGTTGTTCTCGAGAACAACAAGGTTT 
CTGGATGG. Naip2 shRNA3 oligos, CCGGCTTTCAGTCTTGAAGAGACAA 
CTCGAGTTGTCTCTTCAAGACTGAAAGTTTTTG and AATTCAAAAACT 
TTCAGTCTTGAAGAGACAACTCGAGTTGTCTCTTCAAGACTGAAAG. We 
included pLKO.1-TRC control vector (Addgene ID#10879) or pLKO.1 scramble 
(Addgene ID#1864) as negative controls. 

Knockdown of NAIP2 in primary macrophages. Bone marrow was collected 
from C57BL/6 mice on day 0, and plated into one 15-cm plate in macrophage 
differentiation media (see above). Lentivirus encoding Naip2 shRNAs were generated 
according to the Addgene protocol. On day 2, primary bone marrow macrophages 
were collected, red blood cells were lysed, and cells were plated at 1 X 10° cells per 
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well in 6-well plates. Macrophages were spinfected with lentiviral particles at 32 °C, 
90 min, 1,258g, and placed in a 32 °C incubator for 48 h. On day 4, cells were collected 
in cold PBS, re-plated on 10-cm plates containing fresh media containing puromycin 
(5pg ml~') for selection, and placed at 37°C. Puromycin containing media was 
replaced on day 6. On day 8, macrophages were collected, counted and seeded at 
1 X 10° cells per well in a 96-well plate, and infected the following day. In some 
experiments, NAIP2 was knocked down in v-myc/v-raf immortalized bone- 
marrow-derived macrophages that were previously generated’* by use of the J2 virus. 
Listeria infections. Flagellin-deficient strains of Listeria monocytogenes 10403S 
were generated that express the secreted fusion protein ActAN100-PrgJ (Listeria 
AflaA-pPrgJ) or ActAN100-FlaA (Listeria AflaA-pFlaA), under the control of an 
IPTG-inducible promoter, as previously described*. Macrophages were spinfected 
in antibiotic free media at 400g for 10 min at an MOI = 5, with or without IPTG 
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(1 mM), and placed at 37 °C for 30 min. The media was then replaced with com- 
plete media containing gentamycin (10 ,1gml~') and IPTG, and supernatants 
were assayed for LDH assay 5.5h later. 

Salmonella infections. Salmonella enterica serovar Typhimurium strain LT2 or 
isogenic flagellin mutant (FliC FIjB” ) was grown in 10 ml LB standing cultures at 
37°C overnight. The next morning, the cultures were diluted 1:100 in LB and 
grown for 4h (standing culture, 37 °C). Bacteria were added to macrophages at an 
MOI of 10-30, followed by centrifugation at 400g for 10 min. Gentamycin (25 ig 
ml ') was added after 1h to kill remaining extracellular bacteria. Caspase 1 
processing or LDH release was monitored as previously described’. 


26. Krantz, B. A. et al. A phenylalanine clamp catalyzes protein translocation through 
the anthrax toxin pore. Science 309, 777-781 (2005). 


©2011 Macmillan Publishers Limited. All rights reserved 


| sid ial Be 


doi:10.1038/nature10510 


The NLRC4 inflammasome receptors for bacterial 
flagellin and type III secretion apparatus 


Yue Zhao'*, Jieling Yang'**, Jianjin Shi*, Yi-Nan Gong’, Qiuhe Lu’, Hao Xu’, Liping Liu? & Feng Shao? 


Inflammasomes are large cytoplasmic complexes that sense micro- 
bial infections/danger molecules and induce caspase-1 activation- 
dependent cytokine production and macrophage inflammatory 
death’. The inflammasome assembled by the NOD-like receptor 
(NLR) protein NLRC4 responds to bacterial flagellin and a con- 
served type III secretion system (TTSS) rod component*°. How the 
NLRC4 inflammasome detects the two bacterial products and the 
molecular mechanism of NLRC4 inflammasome activation are not 
understood. Here we show that NAIP5, a BIR-domain NLR protein 
required for Legionella pneumophila replication in mouse macro- 
phages®, is a universal component of the flagellin-NLRC4 pathway. 
NAIP5 directly and specifically interacted with flagellin, which 
determined the inflammasome-stimulation activities of different 
bacterial flagellins. NAIP5 engagement by flagellin promoted a 
physical NAIP5-NLRC4 association, rendering full reconstitution 
ofa flagellin-responsive NLRC4 inflammasome in non-macrophage 
cells. The related NAIP2 functioned analogously to NAIP5, serving 
as a specific inflammasome receptor for TTSS rod proteins such as 
Salmonella PrgJ and Burkholderia BsaK. Genetic analysis of 
Chromobacterium violaceum infection revealed that the TTSS 
needle protein CprI can stimulate NLRC4 inflammasome activation 
in human macrophages. Similarly, Cpr is specifically recognized by 
human NAIP, the sole NAIP family member in human. The finding 
that NAIP proteins are inflammasome receptors for bacterial 
flagellin and TTSS apparatus components further predicts that 
the remaining NAIP family members may recognize other uniden- 
tified microbial products to activate NLRC4 inflammasome- 
mediated innate immunity. 

The NLR protein NLRC4 (also known as IPAF) in macrophages 
activates caspase-1 and downstream inflammatory response upon 
sensing cytosolic presence of flagellin during bacterial infection**”*. 
To study the mechanism of the NLRC4 inflammasome, a defined 
biochemical assay was developed by fusing recombinant flagellin 
carboxy-terminal to the amino-terminal domain of anthrax lethal 
factor. This domain (designated as LFn here), through binding to 
another anthrax protein called protective antigen (PA), can efficiently 
translocate heterologous fusion proteins into mammalian cytosol 
through endocytosis-mediated entry’. Using this system, purified 
flagellin from L. pneumophila (LFn-FlaA'?) was found to trigger 
robust caspase-1 cleavage (Fig. 1a), IL-1 release and pyroptotic death 
(Supplementary Fig. 1a, b) in primary bone-marrow-derived macro- 
phages (BMMs). These activations were completely diminished in 
Nirc4-'~ and caspase-1_‘~ macrophages (Fig. 1a and Supplemen- 
tary Fig. 1a,b); Asc /~ (Ascalso known as Pycard) BMMs also showed 
little caspase-1 maturation and IL-1 release but with a partially affected 
pyroptosis due to ASC-independent NLRC4 inflammasome activa- 
tion’. Full activation of NLRC4 inflammasome requires L470, L472 
and L473 in Legionella flagellin’. Accordingly, alanine substitutions of 
the three leucine residues (3A) generated a largely inactive LFn-FlaA'? 
protein (Fig. la and Supplementary Fig. 1a, b). LFn-FlaA'? induced 
similar NLRC4-dependent caspase-1 activation and pyroptosis in 


Tlr4-‘~ macrophages (Supplementary Figs 2a, b and 3a), excluding a 
possible contribution from residual endotoxin contaminants present in 
the recombinant protein. Other bacteria such as Salmonella typhimurium 
also trigger flagellin-dependent NLRC4 inflammasome activation***. 
Delivery of S. typhimurium (LFn-FliC*) or Yersinia enterocolitica 
(LEn-FliC2*°) flagellin into BMMs induced robust caspase-1 activa- 
tion and extensive pyroptosis in an NLRC4-dependent manner (Sup- 
plementary Fig. 4a, b). Thus, LFn-mediated delivery of recombinant 
flagellin recapitulates all known genetic properties of flagellin activa- 
tion of the NLRC4 inflammasome. 

For L. pneumophila infection, flagellin-induced caspase-1 activation 
requires NAIP5 (also known as BIRCIE), a BIR-domain-containing 
NLR protein’. A natural variant of NAIP5 renders macrophages from 
the A/J mouse permissive to L. pneumophila intracellular replication''°. 
The role of NAIP5 for other bacterial flagellins is not clear*!’. RNA 
interference (RNAi) knockdown of Naip5 (Supplementary Fig. 3b) 
severely blocked LFn-FlaA’?-triggered caspase-1 activation and 
pyroptosis (Supplementary Fig. 2a, b). Notably, activation of the 
NLRC4 inflammasome by LFn-Flic*' and LFn-FliC2*°, but not that 
by Salmonella TTSS rod protein (LFn-PrgJ), was also drastically 
reduced by short hairpin RNA (shRNA)-mediated stable knockdown 
of Naip5 (Fig. 1b, cand Supplementary Fig. 3c). Consistently, flagellin- 
triggered caspase-1 activation during Salmonella and Legionella infec- 
tion was significantly attenuated in Naip5 knockdown macrophages 
(Fig. 1d). 

The finding that NAIP5 is a possible integral component of the 
flagellin-NLRC4 pathway inspired us to investigate whether NAIP5 
directly recognizes flagellin. Legionella flagellin was found to show an 
evident yeast two-hybrid interaction with NAIP5, but not NLRC4, 
whereas the 3A mutant showed no interaction (Fig. 2a). Naip5 is 
located within a genomic locus containing seven highly homologous 
Naip genes (Naip1-7) and four of them (Naip1, Naip2, Naip5 and 
Naip6) have transcripts in C57BL/6 mice’’. Legionella flagellin also 
showed a two-hybrid interaction with NAIP6, but not with NAIP1 
and NAIP2 (Fig. 2a). Supporting the two-hybrid results, Legionella 
flagellin expressed in 293T cells readily co-precipitated NAIP5 and 
NAIP6, but not NAIP1, NAIP2 and NLRC4, whereas the 3A mutant 
failed to do so (Fig. 2b). The TLR5-binding-deficient mutant (13914), 
which is fully functional in inflammasome activation’, behaved similarly 
to wild-type flagellin in the co-immunoprecipitation assay. Flagellin 
also co-precipitated NAIP5 encoded by the A/J allele (NAIP5“) 
(Fig. 2b), which explains the normal or nearly normal caspase-1 activa- 
tion in L. pneumophila-infected A/J macrophages'*'*”. 

A panel of nine additional flagellins from different bacteria was 
further profiled (Fig. 2c). In the two-hybrid assay, flagellins from 
S. typhimurium, Y. enterocolitica, Photorhabdus luminescens and 
Pseudomonas aeruginosa showed a positive result whereas those from 
enteropathogenic Escherichia coli (EPEC), enterohaemorrhagic E. coli 
(EHEC), Shigella flexneri, Chromobacterium violaceum and Burkho- 
Ideria thailandensis did not interact with NAIP5 (Fig. 2c and Sup- 
plementary Fig. 5a). NAIP5 interaction with S. typhimurium flagellin 
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Figure 1 | A defined biochemical assay reveals a universal role of NAIP5 in 
flagellin-triggered NLRC4 inflammasome activation in mouse 
macrophages. a, Effects of anthrax lethal factor N-terminal-domain-mediated 
intracellular delivery of Legionella flagellin (LFn-FlaA'?) on caspase-1 
activation in lipopolysaccharide (LPS)-primed BMMs derived from wild-type 
(WT, C57BL/6) or indicated knockout mice. 3A denotes a triple mutant 
flagellin (L470A/L472A/L473A). Shown are anti-caspase-1 and anti-actin 
immunoblots of culture supernatants (top) and total cell lysates (bottom). p10 
denotes the processed mature form of caspase-1. b, c, Effects of Naip5 
knockdown on flagellin-induced caspase-1 activation (b) and cell death (c). A 


jueyeusedns 


ayesh] jueyeusedns 


Po a ae 


LETTER 


b oe & & @ 
< x x x 
SSS 
shRNA C N5 C N5 C N5 C NS 
Pro-Casp1— 


p10— 


Pro Cash! aaa 
ACL a ee ee 


S. typhimurium L. pneumophila 
WT a WT AfliCAfjB WT AflaA WT AflaA 
N5 N5 NS N5 


Casp1— 


yueyewedns 


p10— 


eyes 


Ac — 


Naip5-targeting (N5) (Supplementary Table 1) or a control (C) shRNA was 
stably expressed in immortalized BMMs. LFn-FlaA!”, FliC®' and FliC2”¢ are 
recombinant LFn-tagged flagellins from L. pneumophila, S. typhimurium and 
Y. enterocolitica, respectively. LFn-Prg] is LFn-tagged S. typhimurium TTSS 
rod protein. c, LDH releases are shown as mean values + standard deviation 
(s.d.) from three independent determinations. d, Effects of Naip5 knockdown 
on flagellin-induced caspase-1 activation during Salmonella and Legionella 
infection. AfliCAfjB and AflaA denote flagellin-deficient strains of S. 
typhimurium and L. pneumophila, respectively. 
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Figure 2 | Flagellin interacts specifically with NAIP5 and the interaction 
correlates with the activity of flagellins from different bacteria. a, Yeast two- 
hybrid interaction assays of Legionella flagellin (FlaA"?) and different NAIP 
proteins (or mouse (m)NLRC4). The chart in the lower right corner 
summarizes the interaction results. The known interaction between Legionella 
effector LubX and its secretion chaperon IcmW was included as a positive 
control. b, Co-immunoprecipitation assays of Legionella flagellin (FlaA’®) and 


different NAIP proteins (or NLRC4). Shown are immunoblots of anti-Flag 
immunoprecipitates (IP: Flag) and total cell lysates (Input). 1391A is a TLR5 
binding-deficient flagellin mutant. c, Summary of yeast two-hybrid interaction 
of NAIP5 with different bacterial flagellins. The raw data are shown in 
Supplementary Fig. 5a. d, Caspase-1 activation assays of LEn-mediated delivery 
of different bacterial flagellins into primary BMMs. Number denotations of 
different flagellins follow those in c. 
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required three leucine residues equivalent to those in Legionella 
flagellin. When delivered into BMMs, flagellins from S. typhimurium, 
Y. enterocolitica, P. luminescens and P. aeruginosa, but not those from 
the other five bacteria species, stimulated caspase-1 activation, macro- 
phage death and IL-1 release (Fig. 2d and Supplementary Fig. 5b, c). 
The positive NAIP5-binding and inflammasome-stimulating activities 
of S. typhimurium and P. aeruginosa flagellins agree with their genetic 
requirements for infection-induced caspase-1 activation®**?°?. 
Among those inactive ones, S. flexneri flagellin is not expressed and 
dispensable for host innate immune detection of S. flexneri infection”. 
Genetic ablations of flagellins from EPEC and B. thailandensis also did 
not affect infection-induced caspase-1 activation (Supplementary Fig. 6). 
Thus, the ability of the ten different flagellins to interact with NAIP5 
correlates well with their differential inflammasome-stimulating activity, 
which further supports the idea that flagellin is generally recognized by 
NAIP5 in triggering NLRC4 inflammasome activation. 

NAIP5 and NLRC4 were then co-expressed in 293T cells and their 
possible interactions were investigated. Co-immunoprecipitation of 
NAIP5 and NLRC4 was barely detectable in the absence of flagellin. 
However, co-expression of Legionella flagellin, but not the 3A mutant, 
significantly increased the amount of NAIP5 precipitated by NLRC4 
(Fig. 3a, b). Flagellin was also detected in the NLRC4 immunopreci- 
pitates due to the bridging effect of NAIP5. Deletion of the nucleotide- 
binding P-loop in NLRC4 nucleotide-binding and oligomerization 
domain (NOD) abolished flagellin-simulated NLRC4-NAIP5 inter- 
action (Fig. 3a), which agrees with the reported interaction between 
NOD domains from NLRC4 and NAIP5 (ref. 24). Flagellin also pro- 
moted the association of NLRC4 with NAIP5“”, but neither NAIP1 
nor NAIP2 was precipitated by NLRC4 despite the presence of flagellin 
(Fig. 3b). To test whether the flagellin-stimulated NAIP5-NLRC4 
complex can activate downstream signalling, NAIP5 and NLRC4, 
together with pro-caspase-1 and pro-IL-1B, were co-expressed in 
293T cells. Delivery of LFn-FlaA!”, but not the 3A mutant, into the 
transfected cells resulted in an evident production of mature IL-18 
(Fig. 3c and Supplementary Fig. 7a). Omission of NAIP5, NLRC4 or 
caspase-1 in this reconstitution abolished the response to flagellin 
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Figure 3 | Flagellin stimulates the NAIP5-NLRC4 association and 
reconstitution of flagellin activation of the NLRC4 inflammasome in non- 
macrophage cells. a, b, Co-immunoprecipitation assays of NAIP5 and NLRC4 
interaction in the presence or absence of flagellin. Aploop in a denotes an NLRC4 
mutant with deletion of the nucleotide-binding P-loop. c, Reconstitution of 
flagellin activation of the NLRC4 inflammasome in non-macrophage cells. 
Lysates from 293T cells transfected with indicated plasmid combinations and 
stimulated with LFn-FlaA'? were analysed for mature IL-1B (p17) by 
immunoblotting. Expression of transfected inflammasome components for cand 
d is in Supplementary Fig. 7. d, Assay of different NAIP proteins in supporting 
reconstitution of flagellin activation of the NLRC4 inflammasome in 293T cells. 
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stimulation. NAIP5“” also supported the reconstitution whereas 
NAIP1 and NAIP2 failed to do so (Fig. 3d and Supplementary Fig. 
7b), consistent with their differential association with NLRC4 upon 
flagellin stimulation (Fig. 3b). Moreover, the reconstituted NLRC4 
inflammasome exhibited robust responses to flagellins from 
Salmonella and Yersinia (Supplementary Fig. 8a). Each of the three 
domains in both NLR proteins (CARD, NOD and LRR in NLRC4; 
BIR, NOD and LRR in NAIP5) was essential for assembling a flagellin- 
responsive inflammasome complex (Supplementary Fig. 8b). These 
results indicate that flagellin recognition by NAIP5 stimulates the 
physical association between NAIP5 and NLRC4, thereby signalling 
downstream caspase-1 activation. 

NAIP6 interacted with flagellin in a manner similar to NAIP5 
(Fig. 2b and Supplementary Fig. 9) and supported the reconstitution 
in 293T cells (Fig. 3b, d). In fact, NAIP6, among all the NAIP proteins, 
shares the highest sequence identity with NAIP5, of 94.7% (Supplemen- 
tary Fig. 10). NAIP6 probably has a similar function to NAIP5 in 
macrophage detection of flagellin, but its role might be relatively minor 
given its much lower expression in primary macrophages compared 
with that of NAIP5 (ref. 12). 

The NLRC4 inflammasome also responds to a conserved TTSS rod 
protein such as PrgJ in S. typhimurium, BsaK in B. thailandensis and 
Escl in EPEC’. Delivery of recombinant BsaK (LFn-BsaK) into BMMs 
recapitulated such effects and induced NLRC4-dependent caspase-1 
activation and pyroptosis (Supplementary Fig. 11). Given that NAIP5 
recognizes flagellin and that PrgJ activation of the NLRC4 inflamma- 
some is independent of NAIP5 (ref. 17), we proposed that other NAIP 
proteins could recognize the TTSS rod protein. BsaK was found to 
interact with NAIP2, but not NAIP1, NAIP5, NAIP6 and NLRC4, in 
the two-hybrid assay (Fig. 4a). Co-immunoprecipitation assay con- 
firmed this NAIP2-specific interaction (Fig. 4b). This observation 
agrees with the idea that NAIP2 is the most distantly related to the 
other NAIP proteins (Supplementary Fig. 10). Reconstitution in non- 
macrophage cells further showed that only NAIP2, but not any other 
NAIP, effected robust IL-1B maturation upon LFn-BsaK stimulation 
(Fig. 4c). These findings indicate that NAIP2 is the specific receptor for 
the TTSS rod protein. 

To test the requirement of NAIP2 for detecting the TTSS rod protein 
in macrophages, Naip2 stable knockdown BMMs were generated. 
Among the four different shRNAs (Naip2-1, 2, 3 and 4), Naip2-1 and 
Naip2-2 considerably reduced Naip2 messenger RNA level whereas 
Naip2-3 and Naip2-4 showed intermediate and negligible efficiency, 
respectively (Supplementary Fig. 12a). Naip2-1 and Naip2-2 knockdown 
macrophages exhibited significant resistance in caspase-1 activation and 
pyroptosis to LFn-PrgJ or LFn-BsaK stimulation (Supplementary Fig. 
12b-d). In contrast, Naip2-3 knockdown macrophages showed a mild 
resistance and Naip2-4 knockdown macrophages had a normal sensi- 
tivity to rod protein stimulations. In Naip2-2 knockdown macrophages, 
in which mRNA levels of other Naip genes were not affected (Sup- 
plementary Fig. 13), attenuated caspase-1 activation was only observed 
with the rod protein stimulations, but not with flagellin stimulations 
(Fig. 4d). Furthermore, deletion of genes encoding the rod proteins from 
flagellin-deficient EPEC and S. typhimurium abolished bacterial 
infection-induced caspase-1 activation, and this effect did not occur in 
Naip2-2 knockdown macrophages (Fig. 4e). These results demonstrate 
the critical and specific role of NAIP2 in recognizing the TTSS rod 
protein for NLRC4 inflammasome activation. 

In contrast to mouse macrophages, human U937 monocyte-derived 
macrophages were unresponsive to intracellular delivery of flagellin and 
BsaK/PrgJ-like rod protein (Supplementary Fig. 14). When profiling 
our genetic collection of various pathogenic bacteria, a C. violaceum 
strain (deficient in secretion of TTSS effectors) was identified to be 
capable of inducing caspase-1 activation in human U937 monocytes 
(Supplementary Fig. 15a). Notably, further ablation of five possible 
flagellin genes (AF) caused no reduction in this activation. Stable knock- 
down of NLRC4 (Supplementary Fig. 16) significantly attenuated C. 
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Figure 4 | NAIP2 interacts with the TTSS rod protein and is required for the 
rod protein to trigger mouse NLRC4 inflammasome activation. a, b, Yeast 
two-hybrid (a) and co-immunoprecipitation (b) assays of interactions between 
B. thailandensis rod protein BsaK and different NAIP proteins. 

c, Reconstitution of BsaK activation of the NLRC4 inflammasome in non- 
macrophage cells. Lysates from HeLa cells transfected with indicated plasmid 
combinations and stimulated with LFn-BsaK were analysed for mature IL-1B 
(p17) by immunoblotting. Expression of transfected inflammasome 
components is in Supplementary Fig. 7. d, Effects of Naip2 knockdown on 


ayeshq 


violaceum AF-triggered caspase-1 activation and pyroptosis (Fig. 5a 
and Supplementary Fig. 17a). The PrgJ homologue in the C. violaceum 
Cpi-1 TTSS system, CprJ, is encoded in a separate Cpi-1a locus that 
harbours several additional TTSS apparatus genes” (Fig. 5b). Although 
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caspase-1 activation induced by TTSS rod proteins and flagellins. Control (C) 
or Naip2-2 (N2-2) stable knockdown macrophages (Supplementary Fig. 12) 
were stimulated with purified LFn-tagged BsaK, PreJ, FlaA’? or Flic®* proteins 
as indicated. e, Effects of Naip2 knockdown on rod-protein-induced caspase-1 
activation during EPEC and Salmonella infection. EPEC E2348/69 AfliCAescI 
and S. typhimurium AfliCAfljBAprg] denote the rod-protein-deficient EPEC 
and S. typhimurium strains, respectively, which were constructed on the 
flagellin-deletion background. 


cpr] was not required for infection-induced caspase-1 activation and 
pyroptosis, deletion of the entire Cpi-la locus largely diminished C. 
violaceum-induced NLRC4 inflammasome activation (Fig. 5a, Sup- 
plementary Fig. 15b and Supplementary Fig. 17). Further genetic ana- 
lysis of the entire Cpi-la locus identified cprI, which was essential for 
inducing caspase-1 activation and pyroptosis (Fig. 5c and Supplemen- 
tary Fig. 17b). A Cprl-expressing plasmid could rescue the deficiencies 
of inflammasome activation for both cprI and Cpi-1a deletion strains 
(Fig. 5d). Thus, C. violaceum requires cprI to stimulate NLRC4 inflam- 
masome activation in human macrophages. 

cprI encodes the conserved TTSS needle subunit that is a sequence 
paralogue of the rod protein’®, raising a hypothesis that the needle 
protein is the bacterial ligand recognized by the human NLRC4 
inflammasome. Consistent with the above genetic analyses, LFn- 
mediated delivery of CprI, but not other Cpi-la-encoded proteins, 
triggered robust caspase-1 activation and pyroptosis in U937 cells 


Figure 5 | C. violaceum infection studies reveal that the human NLRC4 
inflammasome responds to the TTSS needle subunit through specific 
recognition by human NAIP. a-c, Caspase-1 activation assays of C. violaceum 
infections of human U937 monocyte-derive macrophages. AF has deletions of 
five possible flagellin genes in C. violaceum. Control (C) or NLRC4 (+) stable 
knockdown cells were used in a. AFACpi-1A means a deletion of the entire 
TTSS Cpi-1A locus illustrated in the schematic drawing shown in b. Detailed 
information for all the mutant strains are listed in Supplementary Table 3. 

d, Complementation of Cpi-1a locus or cprI deletion C. violaceum strains by a 
Cprl-expressing plasmid. PMA-differentiated U937 cells were infected with 
indicated C. violaceum mutant or rescue strain. 2A is a double mutant of CprI 
(V69A/I79A). e, Caspase-1 activation assays of delivery of CprI into human 
U937 macrophages and effects of NLRC4 and ASC knockdown. Control (C) or 
NLRC4 or ASC stable knockdown cells were stimulated with LFn-CprI or other 
indicated LFn fusion proteins. f, Reconstitution of CprI activation of the human 
NLRC4 inflammasome in 293T cells. h, human. g, Co-immunoprecipitation 
assay of Cprl and different NAIP proteins (or NLRC4). 
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(Supplementary Fig. 18a, b), which were largely decreased in NLRC4 
and ASC knockdown cells (Fig. 5e and Supplementary Fig. 18c). 
Mutation of two hydrophobic residues (V69A/I79A, 2A) in a helical 
hairpin region in CprI diminished its activity of stimulating inflam- 
masome activation (Fig. 5d, e and Supplementary Fig. 18). Cprl activa- 
tion of the NLRC4 inflammasome could also be robustly reconstituted 
in 293T cells and the 2A mutant remained inactive in this assay 
(Fig. 5f). Most importantly, this reconstitution required human 
NAIP, the sole NAIP family member in human. Human NAIP-based 
reconstitution specifically responded to LFn-CprI, but not to LFn- 
FlaA!? and LEn-BsaK; LFn-CprI did not activate NAIP5- and 
NAIP2-based reconstitution (Supplementary Fig. 19). Furthermore, 
CprI readily co-precipitated human NAIP, but not any of NAIP2, 
NAIP5 and NLRC4 from 293T cells, and the nonfunctional 2A mutant 
failed to interact with human NAIP (Fig. 5g). Homologous needle 
subunits from EHEC, B. thailandensis, P. aeruginosa, S. flexneri and 
S. typhimurium, but not those from EPEC and V. paraphaemolyticus, 
also stimulated NLRC4 inflammasome activation in U937 cells (Sup- 
plementary Fig. 20). Thus, human NAIP functions analogously to 
mouse NAIP5/2, but specifically recognizes the TTSS needle subunit 
to trigger human NLRC4 inflammasome activation. 

In summary, murine NLR proteins NAIP5 and NAIP2 directly 
recognize bacterial flagellin and TTSS rod protein, respectively, whereas 
human NAIP serves as a specific receptor for the TTSS needle protein. 
Engagement of NAIP receptors by corresponding bacterial ligands 
promotes their physical association with NLRC4, resulting in activation 
of the NLRC4 inflammasome and macrophage innate immunity. The 
inflammasome-stimulating activities of flagellin, TTSS rod and needle 
proteins lie in their C-terminal leucine-rich helical hairpin regions that 
share structural commonalities’’’. Thus, other homologous NAIP 
proteins might recognize additional bacterial products of similar bio- 
chemical features for counteracting diverse bacterial infections. Our 
results also indicate that NLRC4 acts as an adaptor through which 
inflammasome activation signals generated from different NAIP recep- 
tors are transduced to caspase-1. Involvement of an additional cytosolic 
pattern recognition receptor (PRR) protein for sensing one microbial 
product has previously been noted with NLPR3 and NALP1-mediated 
inflammasome activation®*”. Future studies will probably identify 
more PRR proteins that act sequentially within a single inflammasome 
complex in response to microbial products or danger signals. 


METHODS SUMMARY 

LFn-mediated intracellular delivery and RNAi. For delivery into macrophages, 
purified recombinant proteins were washed with 60% isopropanol to remove the 
majority of endotoxin contaminants. LFn-flagellin, LFn-BsaK/PrgJ/CprJ, LFEn-Cprl 
or other indicated control proteins together with PA proteins were added into 
culture medium (serum-free) at a final concentration of 1 pg ml | for each protein. 
Cells were further incubated for 1 h (primary BMMs) or 3 h (immortalized BMMs) 
before being subjected to the indicated inflammasome activation assays. Transient 
small interfering RNA (siRNA) knockdown in macrophages was performed using 
the INTERFERin reagent (Polyplus Transfection) by following the manufacturer’s 
instruction. To achieve stable knockdown in macrophages, a modified pLKO.1- 
GFP plasmid harbouring a specific shRNA (Supplementary Table 1) was trans- 
duced into BMMs or U937 cells by lentiviral infection and GFP-positive knock- 
down cells were sorted out by flow cytometry for further functional analysis. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Plasmids, antibodies and reagents. DNAs for flagellin were amplified from the 
corresponding bacterial genomic DNA, and cloned into pET28a-LFn vector 
(Addgene) for recombinant expression in E. coli as described previously*’*’. BsaK 
and PrgJ DNAs were amplified from B. thailandensis E264 and S. typhimurium LT2 
strains, respectively, and inserted into the same pET28a-LFn vector. DNAs for Cprl, 
CprJ, CorB and CorC were amplified from C. violaceum strain (ATCC accession 
12472) and also cloned into pET28a-LFn vector to prepare recombinant LFn fusion 
protein. PA expression plasmid was also obtained from Addgene. To construct the 
complementation plasmid for the C. violaceum mutant, CprI or Cpr] DNAs with 
ribosome binding site (RBS) sequence were cloned into the pBBRIMCS2 vector. 
Expression plasmids for pro-caspase-1 and pro-IL-1f were provided by X. Wang 
(University of Texas Southwestern Medical Center). cDNAs for mouse NAIP1, 
NAIP2, NAIP5“°72“6, NAIP6, human NAIP and NLRC4 were amplified from 
IMAGE EST clones (40130690, 40086453, 6850660, 100068362, 9052275 and 
5179909, respectively) and mouse NLRC4 was amplified from reverse-transcribed 
mouse cDNA. For mammalian expression, cDNAs for all NLR proteins were cloned 
into modified pCS2 vectors with an N-terminal Myc, HA or Flag epitope tag. All 
truncations and point mutations were generated by standard molecular biology 
procedures. All plasmids were verified by DNA sequencing. 

Antibodies for caspase-1 and Myc epitopes were obtained from Santa Cruz 
Biotechnology. Other antibodies used in this study include IL-1B (3ZD; 
Biological Resources Branch, National Cancer Institute), HA epitope (Covance) 
and Flag M2 (Sigma). 293T and HeLa cells obtained from ATCC were grown in 
Dulbecco’s modified Eagle’s medium containing 10% fetal bovine serum and 
2mM .-glutamine at 37 °C in a 5% COp incubator. Cell culture products were 
from Invitrogen and all other chemicals were Sigma-Aldrich products unless 
noted. 

Mouse BMMs and human monocyte-derived macrophages. C57BL/6 wild-type 
mice were from Vital River Laboratory Animal Technology Co. and caspase-1/~ 
mice* were obtained from the Jackson Laboratory. Nirc4‘~ and Asc ‘~ mice® 
were provided by V. Dixit (Genentech). All knockout alleles have been crossed 
onto the C57BL/6 background. All animal experiments were conducted following 
the Ministry of Health national guidelines for housing and care of laboratory 
animals and performed in accordance with institutional regulations after review 
and approval by the Institutional Animal Care and Use Committee at National 
Institute of Biological Sciences. Primary BMMs were prepared by following a 
standard procedure as previously described**. An immortalized macrophage line 
derived from C57BL/6 mice was provided by K. A. Fitzgerald (University of 
Massachusetts Medical School) and TLR4-deficient immortalized BMMs was a 
gift from A. Ding (Cornell University). Human U937 monocytes obtained from 
ATCC were cultured in RPMI-1640 containing 10% FBS and 2 mM L-glutamine 
and grown at 37°C with 5% CO. 50ngml_' PMA was used to induce U937 
differentiation for 48 h. Differentiated U937 cells were digested with 2 mM EDTA 
in PBS and subcultured in 24-well plates for further experiment. 

Yeast two-hybrid and co-immunoprecipitation assays. Indicated flagellin, bsaK 
and prgJ genes were cloned into the bait vector pLexAde, and mouse Naip1, Naip2, 
Naip5, Naip6 cDNAs and Nirc4 cDNAs were cloned into the prey vector pVP16. 
The bait and prey plasmids were co-transformed into the reporter Saccharomyces 
cerevisiae strain L40 by using the lithium acetate method. Two-hybrid assays were 
performed by following a classical procedure”. 

For immunoprecipitation, 293T cells were transfected with indicated plasmids. 
Cells were harvested and lysed in a buffer containing 50 mM Tris-HCl (pH 7.6), 
150mM NaCl and 1% Triton X-100 supplemented with a protease inhibitor 
mixture (Roche Molecular Biochemicals). Precleared lysates were subjected to 
anti-Flag M2 immunoprecipitation by following the manufacturer’s instructions. 
The beads were washed three times with the lysis buffer and the immunoprecipi- 
tates were eluted in the SDS sample buffer followed by immunoblotting analysis. 
All the immunoprecipitation assays were performed more than three times and 
representative results are shown in the figures. 

Purification of recombinant proteins. E. coli BL21 (DE3) strains harbouring the 
expression plasmids were grown in Luria-Bertani medium (tryptone, 10g] ‘, 
yeast extract, 5 g1~!, NaCl, 10.0 g1~') supplemented with appropriate antibiotics. 
Protein expression was induced overnight at 22°C with 0.4mM isopropyl-B-p- 
thiogalactopyranoside (IPTG) after OD¢00 nm reached 0.8. Bacteria were harvested 
and lysed in a buffer containing 50 mM Tris-HCl (pH 7.6), 300 mM NaCl and 
25 mM imidazole. His-tagged proteins were purified by affinity chromatography 
using Ni-NTA beads (Qiagen). To remove the majority of endotoxin contami- 
nants, proteins bound onto the Ni-NTA column were subjected to an additional 
wash with 60% isopropanol in the wash buffer (>30X column volume). Proteins 
were then eluted with 250mM imidazole in 50mM Tris-HCl (pH 7.6) and 
300 mM NaCl. Eluted samples were further dialysed against a buffer containing 
50 mM Tris-HCl (pH 7.6) and 150mM NaCl to remove the imidazole. Protein 
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concentrations were estimated by Coomassie blue staining of SDS-PAGE gels 
using BSA as the standards. 

NLRC4 inflammasome reconstitution in HeLa and 293T cells. For reconstitu- 
tion in 293T or HeLa cells, cells were seeded into a 6-well plate 12h before 
transfection with indicated combinations of plasmids using the Vigofect reagents 
(Vigorous). The amounts of plasmids used are 2 ig for pro-human IL-1, 100 ng 
(HeLa cells) or 50 ng (293T cells) for caspase-1, 100 ng for NLRC4 and 100 ng for 
NAIP proteins. Twenty-four hours later, LFn-flagellin or LFn-BsaK/PrgJ/CprJ or 
LFn-CprI together with PA proteins was added into the culture medium at the 
final concentration of 10 jigml~' for HeLa cells (2 ugml | for 293T cells) and 
incubated for another 12h. Cells were harvested and lysed in a buffer containing 
50mM Tris-HCl (pH 7.6), 150mM NaCl and 1% Triton X-100. Lysates were 
resolved onto SDS-PAGE gels followed by anti-IL-1f immunoblotting analysis. 
All the reconstitution experiments were performed more than three times and 
representative results are shown in the figures. 

RNAiknockdown. For siRNA knockdown, immortalized BMMs were cultured in 
24-well plates at a density of 4 < 10* per well, and siRNA transfection was per- 
formed using the INTERFERin reagent (Polyplus Transfection) by following the 
manufacturer’s instruction. 2 pl of 201M siRNA (final concentration, 100 nM) 
and 2 ul of INTERFERin reagents were used for each well. Sixty hours after 
transfection, knockdown efficiency and caspase-1 activation were monitored by 
quantitative real-time PCR (qRT-PCR) and anti-caspase-1 immunoblotting ana- 
lysis, respectively. 

To achieve stable knockdown in immortalized BMMs or U937 cells, shRNAs 
targeting NAIP5, NAIP2, human NLRC4 or human ASC (listed in Supplementary 
Table 1) were cloned into a modified lentiviral vector pLKO.1, in which puromycin 
resistance gene was replaced by GFP coding sequence. pLKO.1-GFP shRNA plas- 
mids were transfected together with two packing plasmids (pCMV-dR8.2 dvpr and 
pCMV-VSV-G, both from Addgene) into 293T cells. Lentivirus expressing shRNA 
was collected from the supernatant 48 h after transfection and was used to infect 
BMMs for another 48 h or U937 cells for 12 h. GFP-positive cells were sorted out by 
flow cytometry. The pool of sorted cells were either directly used in subsequent 
functional assays or diluted into 96-well plates to obtain single clones. Knockdown 
efficiency was examined by qRT-PCR analysis or immunoblotting analysis (for 
ASC knockdown in U937 cells). 

Caspase-1-mediated inflammasome activation assays. To assay caspase-1 
activation, culture supernatants of macrophages treated with indicated stimuli 
were subjected to TCA precipitation and the precipitates were analysed by anti- 
caspase-1 immunoblotting to detect both pro-Casp1 and processed mature p10 
fragment; cell lysates were blotted with Casp1 and actin antibodies to show the 
level of pro-Casp1 in cell lysates and actin loading, respectively. All caspase-1 
activation assays in response to LFn-mediated protein delivery and bacterial infec- 
tion were repeated at least three times and the representative results are shown in 
the figures. Mature IL-1 released into the culture supernatants was measured by 
using the IL-1 ELISA kit (Neobioscience Technology Company). Pyroptotic cell 
death was measured by the lactate dehydrogenase (LDH) assay using CytoTox 96 
Non-Radioactive Cytotoxicity Assay kit (Promega). Cell viability was determined 
by the CellTiter-Glo Luminescent Cell Viability Assay (Promega). 

qRT-PCR analysis. For (RT-PCR analysis, total RNA was extracted by TRIzol 
(Invitrogen) and digested with DNase I (Invitrogen). One microgram of total RNA 
was reverse-transcribed into cDNA using M-MLV reverse transcriptase 
(Promega). qRT-PCR analysis was performed using the SYBR Premix Ex Taq 
(TaKaRa) on Applied Biosystems 7500 Fast Real-Time PCR System. Primers used 
for qRT-PCR analysis are listed in Supplementary Table 2. The mRNA level of 
targeted genes was normalized to that of Gapdh for mouse BMMs or to that of 
actin for U937 cells. 

Bacterial manipulation and macrophage infection. L. pneumophila strains were 
cultured on buffered charcoal yeast extract agar supplemented with 0.1 mg ml! 
thymidine (BCYET). For infection, bacteria were scraped, diluted in sterile water 
and added to cells. EPEC strains (E2348/69) were grown overnight in 2x YT 
(tryptone, 16.0 gl, yeast extract, 10.0g1', NaCl, 5.0 gl!) medium without 
shaking, and then diluted 1:40 in DMEM medium for 4h to induce the expression 
of type III secretion system before infection. For S. typhimurium infection, over- 
night 2X YT culture was diluted 1:100 and grown for 3h to induce SPI-1 expres- 
sion. B. thailandensis E264 was obtained from ATCC and cultured as described". 
Wild-type C. violaceum strain (ATCC 12472) was provided by N. Okada and 
cultured as previously described”. To infect U937 cells, the indicated C. violaceum 
strain cultured overnight at 37 °C in LB broth under conditions of vigorous shak- 
ing was diluted 1:100 in fresh LB broth, and further grown for 3h to obtain an 
optical density at A600 of 2.0 to 2.5. The bacteria were diluted in serum-free RP MI- 
1640 medium to achieve a multiplicity of infection (MOI) of 10. All infection 
experiments were performed with a centrifugation of 1,000g for 10 min at 22 °C. 
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The flagellin-deficient L. pneumophila strain (Lp02AflaA) was generated by 
standard homologous recombination using the suicide plasmid pSR47 s. Deletion 
of genes encoding the type III rod protein in EPEC (AescI) and S. typhimurium 
(Aprgj) strains was achieved by using the suicide vector pCVD442 as described 
previously**. For gene deletion in B. thailandensis, a modified suicide vector 
pDM4-pheS expressing a mutant phenylalanine synthetase (PheS) for counter- 
selection’ was constructed first. Briefly, the sacB gene in the commonly used suicide 
vector pDM4 (provided by D. Milton and L. Gong) was replaced with a 1.1-kb PS12- 
pheS fragment (PS12, the promoter of the B. pseudomallei rpsL gene) amplified from 
pBBRIMCS-Km-pheS (provided by T. T. Hoang). A PCR fragment containing 
flanking sequences of the target gene was then cloned into pDM4-pheS. The result- 
ing targeting vector was transferred into B. thailandensis through E. coli SM10 
(Apir)-mediated conjugational mating. The transconjugants were selected in 
LB agar medium containing chloramphenicol (50,1gml~') and streptomycin 
(100 pg ml~'). The integrants were further screened for markerless in-frame dele- 
tion by growth on M9 agar plates supplemented with 20mM glucose and 0.1% 
p-chlorophenylalanine. All the mutants were verified by PCR and DNA sequencing. 
Both flagellin genes in B. thailandensis E264, fliC (open reading frame (ORF), 
BTH_1I3196) and fliC2 (ORF, BTH_II0151), were deleted to obtain the flagellin- 
deficient strain. For gene deletion in C. violaceum, the original pDM4-SacB suicide 
vector was used. Briefly, a PCR fragment containing flanking sequence of the 
targeted gene was cloned into pDM4-SacB. The resulting targeting vector was 
transferred into C. violaceum through E. coli SM10 (Apir)-mediated conjugational 
mating. The transconjugants were selected in LB agar medium containing 
chloramphenicol (17 jtg ml 1) and nalidixic acid (25 ug ml 1) The integrants were 
further screened for markerless in-frame deletion by growth on LB agar plates 
containing 16% sucrose without NaCl. Detailed information for all deletion strains 
are listed in Supplementary Table 3. All the mutants were verified by PCR and DNA 
sequencing. 

To examine the role of flagellin in stimulating caspase-1 activation during mouse 
macrophage infection, wild-type, type III secretion-deficient AescN (CVD452, 


provided by M. Donnenberg) and flagellin-deficient AfliC (AGTO1, provided by 
J. A. Girén) strains of EPEC E2348/69 were used to infect immortalized BMMs ata 
MOI of 10 for 2 h. Wild-type, type II-deficient AbipB and flagellin-deficient AfliC/ 
‘fliC2 strains of B. thailandensis were used to infect J774 mouse macrophages at a 
MOI of 10 for 2h. To assay the physiological function of NAIPS5 in detecting 
bacterial flagellin, control or Naip5 stable knockdown immortalized BMMs were 
infected with S. typhimurium strain (wild type, ATCC 14028 s or AfliCAfIjB mutant, 
fliC::Tn10 fljB5001::Mud-Cm; both strains were provided by E. A. Miao) for 
15min, or L. pneumophila (Lp02 or Lp02AflaA) for 40 min at a MOI of 50. To 
assay the function of NAIP2 in detecting the type III rod protein during infection, 
control or Naip2 stable knockdown immortalized BMMs were infected with 
S. typhimurium (AfliCAfiB or AfliCAfljBAprg]) for 30 min or EPEC E2348/69 
strain (AfliC or AfliCAescl) for 2h at a MOI of 50. Supernatants and cell lysates 
of infected macrophages were collected and subjected to caspase-1 activation assays 
described above. 


30. Yao, Q. et al. A bacterial type Ill effector family uses the papain-like hydrolytic 
activity to arrest the host cell cycle. Proc. Nat! Acad. Sci. USA 106, 3716-3721 
(2009). 

31. Cui, J. etal. Glutamine deamidation and dysfunction of ubiquitin/NEDD8 induced 

by a bacterial effector family. Science 329, 1215-1218 (2010). 

32. Li,P. etal. Mice deficient in IL-1 8-converting enzyme are defective in production of 

mature IL-1B and resistant to endotoxic shock. Cel/ 80, 401-411 (1995). 

33. Mariathasan, S. et al. Differential activation of the inflammasome by caspase-1 

adaptors ASC and Ipaf. Nature 430, 213-218 (2004). 

34. Boyden, E. D. & Dietrich, W. F. Nalp1b controls mouse macrophage susceptibility 

to anthrax lethal toxin. Nature Genet. 38, 240-244 (2006). 

35. Vojtek, A. B. & Cooper, J. A. Rho family members: activators of MAP kinase 

cascades. Cell 82, 527-529 (1995). 

36. Dong,N.,Liu,L.& Shao, F.A bacterial effector targets host DH-PH domain RhoGEFs 

and antagonizes macrophage phagocytosis. EMBO J. 29, 1363-1376 (2010). 

37. Barrett, A.R.eta/. Genetic tools for allelic replacement in Burkholderia species. Appl. 
Environ. Microbiol. 74, 4498-4508 (2008). 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


doi:10.1038/nature10407 


SUMOI1-dependent modulation of SERCA2a in heart 


failure 
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& Roger J. Hajjar' 


The calcium-transporting ATPase ATP2A2, also known as 
SERCA2a, is a critical ATPase responsible for Ca** re-uptake 
during excitation-contraction coupling. Impaired Ca”* uptake 
resulting from decreased expression and reduced activity of 
SERCA2a is a hallmark of heart failure’. Accordingly, restoration 
of SERCA2a expression by gene transfer has proved to be effective in 
improving cardiac function in heart-failure patients’, as well as in 
animal models’. The small ubiquitin-related modifier (SUMO) can 
be conjugated to lysine residues of target proteins‘, and is involved 
in many cellular processes’. Here we show that SERCA2a is 
SUMOylated at lysines 480 and 585 and that this SUMOylation is 
essential for preserving SERCA2a ATPase activity and stability in 
mouse and human cells. The levels of SUMO1 and the SUMOylation 
of SERCA2a itself were greatly reduced in failing hearts. SUMO1 
restitution by adeno-associated-virus-mediated gene delivery main- 
tained the protein abundance of SERCA2a and markedly improved 
cardiac function in mice with heart failure. This effect was compar- 
able to SERCA2A gene delivery. Moreover, SUMO] overexpression 
in isolated cardiomyocytes augmented contractility and accelerated 
Ca’* decay. Transgene-mediated SUMO1 overexpression rescued 
cardiac dysfunction induced by pressure overload concomitantly 
with increased SERCA2a function. By contrast, downregula- 
tion of SUMO1 using small hairpin RNA (shRNA) accelerated 
pressure-overload-induced deterioration of cardiac function and 
was accompanied by decreased SERCA2a function. However, 
knockdown of SERCA2a resulted in severe contractile dysfunction 
both in vitro and in vivo, which was not rescued by overexpression 
of SUMO1. Taken together, our data show that SUMOylation is a 
critical post-translational modification that regulates SERCA2a 
function, and provide a platform for the design of novel therapeutic 
strategies for heart failure. 

It was previously reported that SERCA2a activity could be modu- 
lated by post-translational modifications (PTMs) such as glutathiola- 
tion® and nitration’. We identified SUMO1 as a SERCA2a interactor in 
a proteomic screen (Supplementary Fig. la, b). We confirmed that 
SERCA2a binds in vivo to SUMOI and to ubiquitin-conjugating 
enzyme E2I (UBE2I, also known as UBC9) (Supplementary Fig. 1c-e). 

We then examined whether SERCA2a is indeed SUMOylated in 
human hearts. The SERCA2a band was present along with other slowly 
migrating SERCA2a bands (Fig. la, top panel), which represent the 
SUMOylated protein. The level of SUMOylated SERCA2a was sig- 
nificantly reduced in failing hearts compared to normal hearts. 
SUMOpylation of SERCA2a was specific for SUMO1 (but not for 
SUMO2 or 3) (Supplementary Fig. 1f). Along with the reduction in 
SERCA2a, which is consistent with previous reports, the level of 
SUMO! was also significantly reduced in failing hearts. However, the 
levels of UBC9 and SENPI, critical SUMOylating and de-SUMOylating 
enzymes, were unaltered in failing hearts (Fig. 1a, bottom panel). These 


data indicate that the reduction in SERCA2a SUMOylation might be 
primarily due to the reduced level of SUMO] and not to alterations in 
the level or activity of SUMOylating or de-SUMOylating factors. In 
additional experiments, we observed that the levels of both SUMO1 
and SERCA2a were significantly reduced in a murine model of heart 
failure induced by pressure overload (Fig. 1b), and ina porcine model of 
heart failure induced by volume overload (Fig. 1c). 

To characterize the potential role of SUMO1 during heart failure, we 
restored SUMO1 expression in a murine model of transverse aortic 
constriction (TAC)-induced heart failure. We delivered cardiotropic 
recombinant adeno-associated viruses, serotype 9 (rAA V9) when mice 
developed heart failure (Fig. 1d). Gene transfer of rAAV9-SUMO1 
resulted in a dose-dependent increase in SUMO] in the myocardium 
one month after gene transfer (Supplementary Fig. 2). After induction 
of heart failure and before gene delivery (two months after TAC), 
cardiac function was markedly decreased in the mice. The TAC mice 
received tail-vein injection of either rAAV9-GFP (encoding green 
fluorescent protein, used as a negative control), rAAV9-SERCA2a 
(positive control) or rAAV9-SUMO1, and were followed for two more 
months. Cardiac function assessment by echocardiography revealed 
substantial improvements after rAAV9-SUMOI] gene delivery (at a 
total of four months after TAC), despite constant pressure overload, 
compared with severe deterioration in the rAAV9-GFP control 
(Fig. le, f and Supplementary Table 1). The mice injected with 
rAAV9-SUMOl]1 recovered cardiac function to the same degree as 
the mice injected with rAAV9-SERCA2a. Furthermore, haemo- 
dynamic analyses showed substantially improved left-ventricular 
function in the SUMO1-treated mice, and stabilization of the ratio of 
heart-weight to body-weight (Supplementary Fig. 3 and Supplemen- 
tary Table 2). The survival rate in the rAAV9-SUMO1-injected mice 
was also significantly extended compared with rAAV9-GFP-injected 
mice with heart failure (Fig. 1g). Consistent with the improvement in 
cardiac performance in rAAV9-SUMO1-injected mice, protein levels 
of SERCA2a were significantly increased in rAAV9-SUMO1-injected 
mice compared with rAAV9-GFP-injected animals, indicating that 
SUMO] restoration maintained SERCA2a levels during heart failure 
(Fig. 1h). Thus, we provide the first evidence, to our knowledge, that 
cardiac restoration of SUMO1 by rAAV9-mediated gene transfer 
enables long-term improvement of cardiac function. These findings 
provide further evidence that manipulating PIMs of SERCA2a may 
result in a functional benefit. 

SUMOylation usually occurs in a highly conserved recognition motif*. 
We found two putative SUMO conjugation sites in human SERCA2a, 
lysines 480 and 585. These residues are perfectly conserved in mouse, rat, 
pig and human SERCA2a (Supplementary Fig. 4). We generated three 
SERCA2a variants with mutations in these lysines. Whereas K480R and 
K585R were SUMOylated at levels indistinguishable from wild-type 
SERCA2a, the K480R/K585R mutant was completely un-SUMOylated 
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Figure 1 | Endogenous SUMO1 protein levels are decreased in both 
experimental animal and human heart failure. a, SERCA2a SUMOylation in 
human cardiac tissues (n = 4 per heart failure (HF), n = 3 per non-failing 
control (NF)). Representative immunoblots and protein quantification results 
are shown ( = 5 in each group). S-“SERCA2a, SUMOylated SERCA2a; 
GAPDH, glyceraldehyde 3-phosphate dehydrogenase; NS, not significant. 

b, c, Representative immunoblots for SUMO1 and SERCA2a, and 
quantification of protein levels, in models of heart failure. The experimental 
models are TAC-induced heart failure in mice (b; n = 5 TAC, n = 8 sham- 
operated control hearts), and porcine heart failure induced by mitral valve 
regurgitation (MR) (c; n = 3 MR, n= 5 sham-operated). d, Protocol for 
SUMO] gene therapy in mouse model of heart failure. e, Representative gross 
images of whole hearts (top left panel), haematoxylin and eosin staining results 
(bottom left panel) and two-dimensional guided M-mode images of the left 
ventricle (right panel), from rAAV9-SUMO1-injected mice, or negative 


(Fig. 2a). Because the SUMOylated lysine residues, K480 and K585, 
reside in the nucleotide-binding domain that binds ATP, we predicted 
that SUMOylation might affect the ATPase activity of SERCA2a. As 
expected, the K480R/K585R mutant showed a significantly decreased 
ATPase activity compared to the wild type. SUMO1 co-expression sig- 
nificantly increased the ATPase activity of the wild type but did not affect 
the ATPase activity of K480R/K585R (Fig. 2b). We further tested whether 
SUMOylation affects the ATP-binding affinity of SERCA2a. Co-express- 
ion of SUMO1 markedly increased the ATP-binding affinity of the wild 
type. By contrast, K480R/K585R had considerably less ATP-binding 
affinity, which was not affected by SUMO1 co-expression (Fig. 2c). 
These data indicate that SUMOylation increases the ATPase activity of 
SERCA2a at least partially by enhancing its ATP-binding affinity. 
Because both SERCA2a and SUMO] levels were reduced in heart 
failure, we studied whether SUMOylation affects SERCA2a stability. 
The estimated half-life of the wild type was 4.9 days, which increased to 
5.9 days when SUMO] was co-expressed. The estimated half-life of the 
K480R/K585R mutant was significantly reduced compared to wild 
type (Fig. 2d). In endoplasmic-reticulum-associated protein degrada- 
tion, misfolded proteins are usually degraded by 26S proteasomes after 
polyubiquitination’. To determine whether the instability of the 
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controls injected with rAAV9-SERCA2a or rAAV9-GFP, subjected to TAC. 
f, Effect of rAAV9-SUMO1 treatment on echocardiography indices of left- 
ventricle function measured by fractional shortening (FS) and ejection fraction 
(EF), and left-ventricle end-diastolic diameter measured by internal diameters 
in end-diastole (LVIDd) and end-systole (LVIDs). TAC rAAV9-GEP, n = 10; 
TAC rAAV9-SUMOI1, n = 14; TAC rAAV9-SERCA2a, n = 12. g, Survival of 
animals after rAAV9-mediated SUMO1 restoration. The Kaplan-Meier 
method was used to analyse animal lifespan after infection with different 
viruses. rAAV9-GFP, n = 12; rAAV9-SUMOI1 or rAAV9-SERCA2a 

(5X 10'°vg mouse -), n= 24 in each group. Of 50 TAC-operated animals, 34 
mice survived and were further divided into groups that received rAAV9-GFP 
(n = 9), rAAV9-SUMO] (n = 13) or rAAV9-SERCA2a (n = 12). 

h, Immunoblot results from a representative experiment and protein 
quantification (n = 5 per group). All data represent the mean = s.d. *, P< 0.05 
versus the respective control, as determined by Student's t-test. 


K480R/K585R mutant involves ubiquitin-dependent degradation, 
we performed ubiquitination assays of SERCA2a in HEK-293 cells. 
SUMO] overexpression decreased levels of polyubiquitin-conjugated 
wild-type SERCA2a, whereas it did not alter polyubiquitination of the 
K480R/K585R mutant (Supplementary Fig. 5). These data support our 
hypothesis that SUMOylation may compete with ubiquitination on 
SERCA2a, and indicate that SUMOylation increases the stability of 
SERCA2a by preventing 26S-proteasomal degradation of SERCA2a. 
To examine the physiological function of SUMO1, we isolated 
cardiomyocytes from adult mice after TAC-induced heart failure or 
sham operation, and then infected the cells with adenovirus expressing 
either f-galactosidase (Ad-B-gal) or SUMO1 (Ad-SUMO1). When 
infected with Ad-SUMO1, cardiomyocytes from sham-operated mice 
showed significantly enhanced contractility compared to Ad-f-gal- 
infected cardiomyocytes. A more prominent enhancement in con- 
tractility was observed when failing cardiomyocytes were infected 
with Ad-SUMO1. Ad-SUMO1-infected cardiomyocytes showed 
increased Ca** amplitude and faster Ca** decay compared to the 
Ad-f-gal-infected controls. The overall inotropic effect of SUMO1 
overexpression was comparable to that induced by SERCA2a over- 
expression (Fig. 3a). It is possible that SUMOl1 overexpression 
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Figure 2 | SUMO1 is conjugated to lysines 480 and 585 of SERCA2a and is 
required for SERCA2a function. a, SUMOylation of SERCA2a in vivo. HEK- 
293 cells were co-transfected with plasmids expressing Flag-tagged SUMO1 
and Myc-tagged UBC9 and wild-type or mutant SERCA2a. SUMOylated forms 
of SERCA2a were detected by immunoblot (IB) analysis using anti-SERCA2a 
antibody. IP, immunoprecipitation. b, Ca?*-dependent ATPase activity of 
wild-type SERCA2a and the K480R/K585R mutant in the presence and absence 
of additional SUMO1. The data represent three independent experiments, each 
performed in duplicate. c, ATP binding capacity of wild-type and K480R/ 
K585R SERCA2a. Lysates from HEK-293 cells transfected with the indicated 
plasmids were affinity-precipitated with ATP-sepharose and subsequently 
subjected to immunoblot analysis with an anti-SERCA2a antibody. d, Effects of 
SUMO1 overexpression on the stability of wild-type and K480R/K585R 
SERCA2a mutant protein in HEK-293 cells. HEK-293 cells were transfected 
with wild-type or K480R/K585R SERCA2a expression plasmids, together with 
empty or SUMO1-expression plasmids. Cells were treated with cycloheximide 
48 h after transfection to inhibit protein synthesis. Immunoblot analysis was 
performed with the indicated antibodies at different time points. Quantification 
data represent the ratio relative to day 0 (n = 3). All data represent the 

mean + s.d. *, P<0.05; **, P<0.001 versus the respective control, as 
determined using Student’s t-test. 


enhanced cardiomyocyte contractility at least partly by increasing the 
enzymatic activity and stability of SERCA2a. 

Given our positive in vitro and gene-delivery results, we then pro- 
ceeded to define the physiological consequences of SUMO1 over- 
expression in vivo. We generated cardiac-specific, Cre/loxP-conditional 
Sumol-transgenic mice (Supplementary Fig. 6a, b). Along with 
increased SUMO1 levels, the levels of both SERCA2a itself and its 
SUMOylation were significantly induced by tamoxifen administration 
(Supplementary Fig. 6c, d). No cardiac dysfunction was apparent in 
these transgenic animals (Supplementary Table 3). Wild-type and 
transgenic mice were subjected to TAC operations and heart failure 
developed in two months, characterized by a decrease of less than 50% 
in fractional shortening. One month after tamoxifen was administered 
(three months in total after TAC), cardiac function was examined. At 
this point, wild-type mice showed severe failing phenotypes with left 
ventricular dilation and a low ejection fraction. However, tamoxifen- 
induced overexpression of SUMO1 markedly reversed these failing 
phenotypes, resulting in less left ventricular dilation and improved 
fractional shortening and ejection fraction (Fig. 3b, c and Supplemen- 
tary Table 4). Haemodynamic analyses also showed improved left 
ventricular function in the transgenic mice, along with stabilization 
of heart-weight to body-weight ratio (Supplementary Fig. 7 and Sup- 
plementary Table 5). The recovery from cardiac dysfunction induced 
by SUMO] overexpression was also manifested by the increased sur- 
vival rate of transgenic animals under prolonged pressure overload 
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(Fig. 3d). All transgenic animals survived 100 days after tamoxifen 
administration, whereas only seven out of fifteen wild-type animals 
survived at this time point. These findings are consistent with the 
results from the rAAV9 gene-delivery study. 

Next, we monitored the expression levels of key regulatory proteins 
involved in Ca?* homeostasis. After TAC, a notable reduction in the 
level of SERCA2a was observed, and levels of sodium/calcium exchanger 
1 (SLC8A1, also known as NCX1) increased compared to sham-operated 
controls (Fig. 3e). A decrease in SERCA2a function is coupled with an 
increase in NCX1 function in failing hearts’®. The TAC-induced changes 
in the levels of SERCA2a and NCX1 did not occur in transgenic animals. 

In wild-type animals, TAC resulted in a significant reduction of 
SERCA2a ATPase activity, but this was significantly lessened in the 
transgenic animals (Fig. 3f). Taken together, these data indicate that 
SUMOI overexpression ameliorates cardiac dysfunction induced by 
pressure overload. 

To evaluate the effects of downregulation of SUMO] in hearts, we 
generated rAAV9 viruses that express SUMO1-directed shRNA 
(rAAV9-shSumo1), or a scrambled sequence (rAAV9-SC). We con- 
firmed the efficiency and specificity of the shSumol in mice 
(Supplementary Fig. 8a, b). Six weeks after tail-vein injection of viruses, 
cardiac function was evaluated. Hearts from rAAV9-shSumo1- 
injected mice showed left ventricular dilation and functional deteri- 
oration compared to hearts from rAAV9-SC-injected mice (Fig. 4a, b 
and Supplementary Tables 6 and 7). Injection of an increased dose of 
rAAV9-shSumol resulted in more severe cardiac dysfunction 
(Supplementary Fig. 8c, d). Cardiac dysfunction induced by rAAV9- 
shSumol1 manifested as sudden death of the mice. All mice receiving 
1 X 10'' viral genomes (vg) of rAAV9-shSumol1 died within 3 weeks. 
The death rates of mice that received 3 x 10'° vg and 5 X 10'° vg of 
rAAV9-shSumol1 were slightly higher than the rates in control mice 
that received rAAV9-SC (none of these control mice died) (Fig. 4c). 
SERCA2a protein levels were decreased by approximately 40% in 
rAAV9-shSumo1-injected hearts, and NCX1 levels were slightly ele- 
vated, although this was not statistically significant. As expected, 
SUMOylation of SERCA2a was also notably blunted (Fig. 4d and 
Supplementary Fig. 8e). These results support the premise that 
SUMO] is important for the stability and function of SERCA2a. 
Downregulation of SUMO] also significantly suppressed the ATPase 
activity of SERCA2a (Fig. 4e), and this was further impaired when a 
higher dose of rAAV9-shSumo1 was injected (Supplementary Fig. 8f). 
These data show that SUMO1 is an essential regulator of SERCA2a 
function in the heart. 

We next tested the SUMOylation status of some important cardiac 
transcriptional factors such as GATA binding protein 4 (GATA4)"' and 
serum response factor (SRF)'*. These factors are involved in cardiac 
pathogenesis and have been reported by other groups to be regulated by 
SUMOylation''’”. The SUMOylation status and protein expression 
level of GATA4 were not altered by SUMO1 gene delivery, whereas 
SUMOpylation of SRF was increased (Supplementary Fig. 9). Increased 
SRF SUMOylation may contribute to the activation of certain genes, 
such as SERCA 2a and genes encoding contractile proteins, during heart 
failure’. Several other transcriptional factors, such as SP1 and SP3, and 
calcium-regulatory signalling pathways, are associated with SERCA2a 
gene expression’*’*. Notably, the cleaved form of SRF was detected in 
failing hearts and this was decreased by SUMO1 and SERCA2a gene 
delivery. Recent studies have shown that caspase-3-mediated SRF 
cleavage occurs in heart failure’® and SUMO1 overexpression blocks 
cell-death signalling induced by both tumour necrosis factor and FAS”. 
Our findings and other recent studies indicate that SUMO] restoration 
increases levels of SUMOylated SRF and may alter apoptotic signalling 
in heart failure. 

To determine the critical link between SUMO1 and SERCA2a in the 
setting of cardiac dysfunction, we knocked down SERCA2a using a len- 
tiviral system carrying a shRNA directed towards Serca2a (shSerca2a). In 
isolated cardiomyocytes from rat and mouse, shSerca2a markedly 
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Figure 3 | SUMO1 overexpression restores TAC- 
induced cardiac dysfunction. a, Bar graphs show 
the average of the contractility and Ca** -transient 
parameters. Isolated mouse cardiomyocytes were 
infected with the indicated adenoviruses and their 
mechanical properties were measured. Sham 
contractility, n = 143 cells from 6 mice; heart- 
failure contractility, n = 181 cells from 8 mice; 


sham Ca?* transient, n = 65 cells from 4 mice; 
heart-failure Ca’* transient, n = 131 cells from 
6 mice. b, Representative gross images of whole 
hearts (top left panel), haematoxylin and eosin 
staining results (bottom left panel) and two- 
dimensional guided M-mode images of the left 
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ventricle (right panel) from Sumo1-transgenic 
mice or wild-type controls subjected to sham- 
operation or TAC. ¢, Echocardiographic 
measurements, as in Fig. 1f. Sham wild-type, 

n= 12; sham transgenic, n = 10; TAC wild-type, 
n= 14; TAC transgenic, n = 12. d, Kaplan-Meier 
survival curves of wild-type mice (dotted line, 

n= 15) and Sumo1-transgenic mice (solid line, 
n= 14), after TAC operations. e, Representative 
immunoblot analysis showing changes in the levels 
of cardiac proteins in Sumo 1-transgenic mice (left 
panel), and their quantification (right panel; n = 4 
for each group). PLB, phospholamban; RyR2, 
ryanodine receptor 2. f, Ca”*-dependent ATPase 
activity of SERCA2a in preparations from sham- 
operated wild-type, sham-operated Sumo1- 
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decreased contractile parameters and these were not improved by over- 
expression of SUMO1 (Supplementary Figs 10a, b and 11a). In vivo, rat 
ventricular function was significantly decreased after gene transfer of 
shSerca2a but no improvements were observed after co-infection with 
Ad-Sumo1 (Supplementary Fig. 10c-f and Supplementary Table 8). In 
mice, antecedent overexpression of SUMO1 did not rescue ventricular 
function after lentiviral gene transfer of shSerca2a (Supplementary 
Fig. 11b-e and Supplementary Table 9). Taken together, these four 
experiments show that SERCA2a is critical in modulating SUMOI’s 
beneficial effects in the setting of heart failure. However we cannot 
exclude that knockdown of SERCA2a may have effects on other proteins 
that would not allow SUMOI to rescue contractile function. 

We have showed that SERCA2a is SUMOylated at two lysine resi- 
dues. We found that the levels of both SERCA2a itself and SERCA2a 
SUMOylation were significantly reduced in failing hearts. We have 
provided compelling evidence that the reduction of SERCA2a 
SUMOpylation is a direct result of reduced SUMO1 levels in failing 
hearts. This reduction in SUMOylation correlated with reduced 
ATPase activity and decreased SERCA2a stability. Moreover, restora- 
tion of SUMO] reversed contractile dysfunction in failing hearts. Our 
findings are summarized in Fig. 4f. 

It is possible that SUMOylation may induce a conformational 
change’* in SERCA2a or may provide an additional interface for 
ATP binding, leading to increased ATPase activity. It is also possible 
that SUMOylation may reciprocally and competitively affect other 
PTMs of SERCA2a and, in particular, may alter acetylation”. 
Notably, acetylation of SERCA2a was recently identified in a large- 
scale analysis of the human acetylome in cancer cell lines”. We also 
found that SERCA2a is indeed acetylated and that this acetylation is 
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more prominent in failing hearts and can be reversed by SIRT1 
deacetylase (Kho et al., unpublished data). Further elucidation of the 
reciprocal or competitive relationship between SUMOylation and 
acetylation of SERCA2a is currently underway. There are precedents 
for this type of SUMOylation-mediated inhibition of protein degrada- 
tion. For example, SUMOylation of axin, a negative regulator of Wnt 
signalling, prevents ubiquitination and thus induces a prolonged axin 
half-life’. Similarly, SUMOylation of the RNA helicases p68 and p72 
increases their stability by reducing ubiquitin-proteasome-mediated 
protein degradation”’. Detailed biochemical analyses are underway to 
identify the ubiquitination sites in SERCA2a. 

Our data show that the amount of SUMO] is reduced in failing hearts 
from various species, indicating that the level of cellular SUMO1 must be 
precisely maintained and controlled for proper cardiomyocyte 
function. Our finding that restoration of SUMO] reversed TAC-induced 
heart failure indicates that a reduction in SUMO1 levels is a direct cause 
of contractile dysfunction. Our study was based on strong background 
data that has clearly established impaired SERCA2a as a key molecular 
abnormality in heart failure. The therapeutic effects of SUMO1, with 
near-complete recovery of contractile function, are notable, considering 
that expression of only a single gene is altered. In contrast to the reduc- 
tion of SUMO] levels in failing hearts, the levels of the SUMOylating and 
de-SUMOylating enzymes UBC9 and SENP1 were unaltered, and did 
not change when shSUMOI was administered or when SUMO] levels 
were restored (data not shown). Therefore, the specificity and capacity of 
SUMOylation itself is unlikely to be altered in failing hearts. Instead, the 
supply of SUMOL1 itself seems to have the largest impact on 
SUMOylation. In this regard, it is notable that depletion of cellular 
ubiquitin is sufficient to cause neuronal dysfunction and death”. 
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Figure 4 | Reduction of SUMO1 levels accelerates cardiac dysfunction. 

a, Heart morphology, cardiac function and heart dimensions were examined in 
mice 6 weeks after injection of 5 X 10'° vg of rAAVs. Representative gross 
images of the hearts (top left panel), haematoxylin and eosin staining results 
(top right panel) and M-mode imaging data (bottom panel) are shown. 

b, Echocardiographic measurements, as in Fig. 1f, forrAAV9-shSC or rAAV9- 
shSumo1-injected mice (5 X 10'° vg mouse‘, n = 14 per group). ¢, Survival of 
animals after rAAV9-mediated cardiac knockdown of SUMO1. Kaplan-Meier 
curves are shown for different doses of rAAV9-shSumo1 (n = 14 per group) or 
rAAV9-shSC (5 X 10'°vg mouse ', n = 24). Solid line represents rAAV9- 
shSC-injected animals and dotted lines represent rAAV9-shSUMO1-injected 
animals. d, Representative immunoblot analysis for the indicated cardiac 
proteins in samples collected 6 weeks after tail-vein injection of rAAV9-shSC 
(5X 10'° vg mouse 1, n= 4) or rAAV9-shSumol (5 X 10!° vg mouse ', 

n= 7) (left panel). Quantification of GAPDH-corrected immunoblot signals 
(right panel). e, Ca”* dependence of ATPase activity of SERCA2a. ATPase 
activity was examined in preparations from shSC-injected and shSumo1- 
injected hearts (5 X 10'° vg mouse ', n = 3 per group). All data represent 
mean + s.d. *, P< 0.05; **, P< 0.001 versus the respective control, as 
determined using Student’s t-test. f, A working model for the regulation of 
SERCA2a function by SUMOylation. Under basal conditions, SUMOylation 
enhances the stability of SERCA2a and its Ca** pump function to regulate 
cardiac contractility. However, increased levels of unSUMOylated SERCA2a 
due to low SUMO] protein pools trigger impaired SERCA2a activity and 
induce cardiac dysfunction under pathophysiological conditions. 


In this study we demonstrate a novel regulatory mechanism 
whereby SUMOylation affects SERCA2a activity and the overall con- 
tractile properties of the heart. This highlights the importance of post- 
translational mechanisms in experimental and human heart failure. In 
addition, the beneficial effects of SUMO1 on cardiac contractility and 
survival indicate that targeting SUMO] may provide a novel thera- 
peutic strategy for the treatment of heart failure. 


METHODS SUMMARY 

Animals. All mice and rats were housed and treated in accordance with guidelines 
from the NIH and institutional animal care and use committees, and the protocols 
used were approved by the Mount Sinai School of Medicine animal care and use 
committee. 

Human heart samples. Left ventricular samples were obtained from explanted 
human hearts obtained at the time of cardiac transplantation. Non-failing hearts 
(which were used as controls) were obtained from donors who died from neuro- 
logical diseases or motor-vehicle accidents, and who had normal cardiac function 
by echocardiography. The five donors, three males and two females, had a median 
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age of 62. The heart-failure patients, three males and two females, had a median 
age of 60 and their average ejection fraction was 20 + 3%. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

In vivo SUMOylation assay. To analyse SUMOylation in cells, lipofectamine 2000 
was used to transfect HEK-293 cells with plasmids encoding wild-type or 
SUMOylation-site mutants of SERCA2a, along with Flag-tagged SUMO1 and 
Myc-tagged UBC9. The cells were lysed by sonication in ice-cold lysis buffer 
(50 mM Tris-HCl (pH 8.0), 150 mM NaCl, 0.1% Triton X-100, 10 mM EDTA, com- 
plete protease inhibitor (1 tablet per 10 ml, Roche) and protein phosphatase inhibitor 
cocktail (Sigma)) containing 20mM N-ethylmaleimide. Lysates were cleared by 
centrifugation at 30,000g for 20 min. Cell lysates were then immunoprecipitated 
overnight at 4°C with a Flag-specific affinity matrix gel (Sigma), after which the 
immunoprecipitates were washed in cold lysis buffer. The immunocomplexes were 
then resolved by SDS polyacrylamide gel electrophoresis (SDS-PAGE) and immu- 
noblot analysis was performed with a SERCA2a-specific antibody. 

Fresh tissue extracts were prepared in lysis buffer for in vivo SUMOylation 
assays. Hearts from each experimental and control group were frozen in liquid 
nitrogen. The frozen tissues were crushed and homogenized in lysis buffer as 
described above using the MP homogenate system (FastPrep homogenizer). 
The insoluble portion was removed by centrifugation at 30,000g for 20 min. The 
extracts were incubated with anti-SUMO] agarose resin with agitation overnight. 
The SUMO-conjugated forms were detected by immunoblot analysis with specific 
primary antibodies. 

SERCA2a activity assay. SERCA2a activity was determined using an assay based 
on pyruvate/NADH-coupled reactions, as previously described”. The activity of 
the Ca?*-ATPase was calculated as follows: Aabsorbance/6.22 X protein X time 
(in nmol ATP per mg protein X min). All assays were performed in triplicate. 
Generation of conditional Sumol-transgenic mice. The «MHC-flox-mouse 
SUMO1 transgene was subcloned into the pML2G vector (gift from Y. Wang), 
which encodes GFP cDNA between two loxP sites. The construct was microin- 
jected into fertilized eggs from B6C3 mice, and transgenic integration was con- 
firmed by PCR (mouse genetics shared research facility, Mount Sinai School of 
Medicine). 

Antibodies. Antibodies against the following proteins were used for immunoblot- 
ting: SERCA2a (21st Century Biochemicals), GAPDH (Sigma), Myc (Sigma), Flag 
(Sigma), SUMO1 (Cell Signaling), SUMO1 agarose resin (Santa Cruz Biotech), 
SUMO2/3 (MBL), UBC9 (Boston Biochem), SENP1 (Pierce), PLN (Badrilla), 
RyR2 (Badrilla), NCX1 (Chemicon), GATA4 (Millipore), SRF (Millipore), anti- 
rabbit-HRP and anti-mouse-HRP secondary antibodies (Sigma). 

Primers for mutagenesis. The following primer sets were used for mutagenesis: 
K480R_sense, 5'-TCA GTC ATT AAA CAG CTG ATG AGA AAG GAA TTC 
ACT CTA GAG-3’; K480R_antisense, 5’-CTC TAG AGT GAA TTC CTT TCT 
CAT CAG CTGTTT AAT GAC TGA-3’; K585R_sense, 5’-GAG GAC TCT GCC 
AAC TTT ATT AGA TAT GAG ACC AAT CTG ACC-3'; K585R_antisense, 5’- 
GGT CAG ATT GGT CTC ATA TCT AAT AAA GTT GGC AGA GTC CTC-3’. 
Proteomic analysis. Solubilized immunocomplexes were combined with a rehy- 
dration buffer (9 M urea, 2 M thiourea, 4 M CHAPS buffer, 16 mM dithiothreitol, 
2% w/v pharmalyte 3-11 and trace amounts of bromophenol blue) to a final 
volume of 340 pl, and then rehydrated for 16h. After rehydration, strips were 
focused at 60 kVh at 20 °C (IPGphor III, GE Healthcare). When isoelectric focus- 
ing was complete, the strips were equilibrated, separated on 12.5% SDS-PAGE gels 
and visualized by silver staining using the Plus One silver staining kit (GE 
Healthcare), with minor modifications to ensure compatibility with subsequent 
mass spectrometry analysis. Stained two-dimensional gel images were scanned 
(ImageScanner II, GE Healthcare) and analysed by PDquest 8.1 software system 
(Bio-Rad). Spots were excised from the gels by spot picker and prepared for mass 
spectrometry as previously described”. 

Mass spectrometry analysis. Identification of gel spots was accomplished by ESI/ 
LC/MS/MS system (Thermo-Scientific). Full scan spectra were recorded in posi- 
tive mode over the mass range 350-2000 Da. MS/MS data were automatically 
acquired on the two most intense precursor ions in each full scan spectrum, and 
were interpreted using Xcalibur 2.0 and Bioworks 3.2 software. 

Contractility measurement. The mechanical properties of isolated ventricular 
cardiomyocytes were assessed using a video-based edge detection system 
(IonOptix) as previously described”*. 

Adenoviruses. Adenoviruses encoding SERCA2a and SUMO] were generated 
using the pAdEasy XL adenoviral vector system (Stratagene) according to the 
manufacturer’s protocols. 

rAAV vector production and purification. rAAV9-SERCA2a, rAAV9- 
shSumo1 and rAAV9-SC were produced using the two-plasmids protocol as 
previously described’’ with the following modifications: HEK-293T cells 
(ATCC) were grown in triple flasks for 24h (DMEM, 10% fetal bovine serum) 
before adding the calcium phosphate precipitate. After 72 h, the virus was purified 
from benzonase-treated crude cell lysates over an iodixanol density gradient 
(Optiprep, Greiner Bio-One Inc.), followed by heparin-agarose type I affinity 


chromatography (Sigma). Finally, viruses were concentrated and formulated into 
lactated Ringer’s solution (Baxter Healthcare Corporation) using Vivaspin 20 
centrifugal concentrators 50K MWCO (Vivascience Inc.), and stored at -80 °C. 
Lentivirus. The lentiviral plasmids encoding shRNAs for Serca2a were previously 
described by us**. The Serca2a shRNA construct is the following sequence; the 
underlined segment indicates the loop: 5'-GATCCGACTTACTAGTTAGAA 
TTTGGCTAAGAGCAAATTCTAACTAGTAAGTCTTTTTTGGAATTAAT-3’. 
We used third-generation lentiviral systems to generate the vectors”. 

Transverse aortic constriction. Mice underwent TAC using a supraclavicular 
construction model as previously described*®. TAC or sham surgery was per- 
formed in 8-week-old male mice (body weight 18-22 g). Mice were anesthetized 
with intraperitoneal ketamine and placed on a ventilator. A longitudinal cut of 
2-3 mm was made in the proximal portion of the sternum, allowing visualization 
of the aortic arch. The transverse aortic arch was ligated between the innominate 
and left common carotid arteries with an overlying 27-gauge needle. The needle 
was immediately removed leaving a discrete region of constriction. The sham 
group underwent a similar procedure without ligation. 

Gene transfer in vivo. Adult Sprague-Dawley rats (body weight 240-260 g) 
underwent gene transfer via aortic cross-clamping as previously described”. 
After dissection of the aorta and pulmonary artery, lentiviral shSerca2a with or 
without adenoviral SUMO] were injected into the left ventricular cavity through a 
22G catheter while the aorta and pulmonary artery were cross-clamped for 50 s. In 
sham-operated animals, normal saline was injected into the left ventricular cavity 
while the aorta and pulmonary artery were cross-clamped for 50s. After cross- 
clamping was released, the chest was closed. 

Echocardiography. Transthoracic echocardiography was performed using a 
Vivid 7000 (GE Healthcare) equipped with a H13L transducer (14 MHz). Two- 
dimensional and M-mode images were obtained in the short-axis view. The heart 
rate, left ventricular end-diastolic internal diameter and left ventricular end-sys- 
tolic internal diameter were measured in at least three repeated cardiac cycles. The 
ejection fraction and fractional shortening were then calculated. 
Haemodynamics. Haemodynamic measurements are performed using a 1.2 Fr 
pressure-volume conductance catheter (Scisense). Pressure-volume loop analysis 
was performed as previously described*’. Mice were injected intraperitoneally with 
urethane (1gkg '), etomidate (10mgkg ') and morphine (1mgkg ') and 
mechanically ventilated with 7 lg ’ stroke volume at 125 respirations min’. 
The chest was opened to expose the heart for an apical stab approach. To deter- 
mine absolute ventricular volumes via admittance technology, myocardial and 
blood conductance were obtained before pressure-volume catheter placement in 
the left ventricle’. The inferior vena cava was transiently occluded to reduce 
ventricular pre-load to obtain load-independent pressure-volume relationships. 
Haemodynamic measurements were acquired and analysed using IOX software 
(EMKAtech). 

Histological analysis. Hearts were prepared in Tissue-Tek OCT compound 
(Sakura Finetechnical) and sectioned into 6-,1m slices (Microm HM560 Cryo-star, 
Thermo Scientific). The sections were stained with haematoxylin and eosin. 
Statistical analysis. Statistical analyses were performed using Student’s t-test. 
Significant differences are indicated *, P<0.05 or **, P<0.001. Data in the 
figures represent mean + s.d. 
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The role of Tet3 DNA dioxygenase in epigenetic 
reprogramming by oocytes 
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Sperm and eggs carry distinctive epigenetic modifications that are 
adjusted by reprogramming after fertilization’. The paternal 
genome in a zygote undergoes active DNA demethylation before 
the first mitosis”*. The biological significance and mechanisms of 
this paternal epigenome remodelling have remained unclear*. Here 
we report that, within mouse zygotes, oxidation of 5-methylcytosine 
(5mC) occurs on the paternal genome, changing 5mC into 5- 
hydroxymethylcytosine (S5hmC). Furthermore, we demonstrate that 
the dioxygenase Tet3 (ref. 5) is enriched specifically in the male 
pronucleus. In Tet3-deficient zygotes from conditional knockout 
mice, paternal-genome conversion of 5mC into 5hmC fails to occur 
and the level of 5mC remains constant. Deficiency of Tet3 also 
impedes the demethylation process of the paternal Oct4 and 
Nanog genes and delays the subsequent activation of a paternally 
derived Oct4 transgene in early embryos. Female mice depleted of 
Tet3 in the germ line show severely reduced fecundity and their 
heterozygous mutant offspring lacking maternal Tet3 suffer an 
increased incidence of developmental failure. Oocytes lacking 
Tet3 also seem to have a reduced ability to reprogram the injected 
nuclei from somatic cells. Therefore, Tet3-mediated DNA hydro- 
xylation is involved in epigenetic reprogramming of the zygotic 
paternal DNA following natural fertilization and may also contri- 
bute to somatic cell nuclear reprogramming during animal cloning. 

To investigate whether loss of DNA methylation in the male pro- 
nucleus coincides with oxidation of 5mC to 5hmC, a recently reported 
type of modification of mammalian DNA®®, we performed immuno- 
staining of mouse zygotes using an antibody specifically recognizing 
5hmC (Supplementary Fig. 1). We found that the 5hmC signal 
increased markedly in the paternal pronucleus around the pronuclear 
stage PN3 when the paternal pronucleus became larger than the 
maternal pronucleus (Fig. 1a, b). By contrast, the 5mC signal became 
markedly weaker in the male pronucleus from the PN3 stage whereas 
there was no clear change in the female pronucleus (Supplementary 
Fig. 2), as reported previously*. The inverse correlation between the 
5hmC and 5mC signals in the two parental genomes seemed to persist 
beyond the zygotic stage (Supplementary Fig. 3). Therefore, 5mC 
oxidation in the male pronucleus coincides with the loss of methyla- 
tion in the early mouse embryo. A similar observation has recently 
been reported in two independent studies”*. 

Next, we examined the expression of Tet enzymes that can catalyse 
the oxidation of 5mC in DNA’. The Tet3 mRNA was specifically 
detected in oocytes and zygotes (Supplementary Fig. 4). At the zygotic 
stage, the Tet3 protein was concentrated in the male pronucleus, but 
localized to the cytoplasm at other pre-implantation stages (Fig. lc and 
Supplementary Fig. 5). The unique expression pattern of Tet3 suggests 
its possible role in modifying the zygotic paternal genome. 


To study the biological function of Tet3 in mouse, we generated a 
conditional knockout allele abolishing its catalytic activity (Supplemen- 
tary Fig. 6a). Because homozygous mutation led to neonatal lethality, 
we achieved germ-line-specific deletion of Tet3 from primordial germ 
cells (PGCs) in [Tet3”", TNAP-Cre] conditional knockout (CKO) 
mice. 

Female CKO mice were normal in growth and morphology. 
Although they displayed much reduced fecundity (see below), they 
gave birth to heterozygous offspring when crossed with wild-type 
males (Supplementary Fig. 6b). The deletion of Tet3 in oocytes and 
zygotes was confirmed by immunostaining and PCR with reverse 
transcription assays (Fig. 2a and Supplementary Fig. 6c). Strikingly, 
no 5hmC signal could be detected and the 5mC signal intensity did not 
decline in the late male pronuclei of zygotes collected from the CKO 
females mated with wild-type males (Fig. 2b and Supplementary Fig. 
7). In contrast, deletion of Tet3 from the male germ cells did not seem 
to affect the change in 5hmC and 5mC. Therefore, the loss of 5mC in 
the paternal genome in developing zygotes is caused by its conversion 
to 5hmC and the maternal Tet3 is required for this conversion. 

We then assessed the role of Tet3 in demethylation of specific 
sequences in the male pronucleus. Line1 transposons are known to 
be actively demethylated in zygotes'*". Comparison of the methyla- 
tion level of male pronuclear DNA from Tet3-deficient zygotes at the 
PN3-4 stages with that of wild-type zygotes showed that the process of 
active DNA demethylation was impeded by Tet3 deletion (Fig. 2c, 
upper panel). This finding also indicates that 5hmC serves as an inter- 
mediate between 5mC and unmethylated C, although bisulphite ana- 
lysis cannot distinguish 5hmC from 5mC'*"’. To confirm this, we 
assayed for 5mC and 5hmC on Line! elements in the paternal DNA 
by MeDIP and hMeDIP (methylated and hydroxymethylated DNA 
immunoprecipitation). 5hmC was indeed present at Line1 sequences 
in wild-type zygotic male pronuclei at a significantly higher level com- 
pared to sperm, whereas the 5mC level was markedly lower than that of 
sperm (Supplementary Fig. 8). In the analysis of male pronuclear DNA 
from zygotes lacking Tet3, enrichment of 5hmC-containing Line1 
elements was significantly decreased whereas enrichment of 5mC- 
containing elements was comparable with sperm. These results 
strengthen the conclusion that 5mC oxidation does occur at Line 
sequences in male pronuclei. 

Embryonic stem cell marker genes, such as Oct4 and Nanog, are 
methylated during male germ cell development and their demethyla- 
tion occurs in the early embryo'*"*. In normal zygotes, the Oct4 gene 
of male pronuclei had undergone substantial demethylation by the 
PN3-4 pronuclear stages, but this process was markedly hampered 
in Tet3-null zygotes (Fig. 2c and Supplementary Fig. 9). Moreover, 
Tet3 deletion almost completely blocked demethylation at two other 
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Figure 1 | Specific oxidation of methylcytosine and Tet3 distribution in the 
zygotic male pronucleus. a, Immunofluorescent images of ShmC (green) and 
5mC (red) staining, and overlaid phase contrast images. The pronuclear (PN) 
stages are indicated. Male and female symbols indicate male and female 
pronucleus, respectively. PB, polar body. Scale bar, 25 jim. b, Quantification of 
the relative levels of SamC and 5mC in male pronuclei in zygotes. Each data 
point is based on the level of the 5hmC or 5mC signal relative to the DAPI 


paternally methylated genes, Nanog and Lemd1 (ref. 17), which 
retained hypermethylation similar to that observed in sperm (Sup- 
plementary Fig. 9). To assess the significance of paternal demethyla- 
tion on gene expression, Tet3-null oocytes from the CKO females were 
fertilized by intracytoplasmic injection (ICSI) of wild-type sperm 
carrying the enhanced green fluorescent protein (EGFP) reporter gene 
under the control of the Oct4 promoter, and expression of EGFP was 
monitored in cultured embryos. Compared with wild-type embryos, 
the mutant embryos derived from oocytes lacking Tet3 showed sig- 
nificantly weaker EGFP expression at the 8-cell and morula stages 
(Fig. 2d and Supplementary Fig. 10). Based on these global and 
sequence-specific analyses of 5mC and 5hmC, along with the reporter 
gene assay, we conclude that Tet3-mediated 5mC oxidation contri- 
butes to the demethylation in the zygotic paternal genome and gene 
activation in the early embryo. 

We next investigated whether removing Tet3 from oocytes might 
compromise embryonic development. We first confirmed that 
early deletion of Tet3 from the PGC stage did not affect epigenetic 
reprogramming in the embryonic germ cells, oocyte development, 
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staining intensity of the same pronucleus. Error bars indicate s.e.m. Number of 
zygotes analysed for each stage: PNO, 19; PN1, 10; PN2, 8; PN3, 17; PN4, 17; 
PNS, 15. ¢, Preferential staining of Tet3 protein in the male pronucleus. DNA 
was stained with DAPI. Control staining shows Dnmtl in the cortical 
cytoplasm and Dnmt3a (red) in both pronuclei. The nucleolus had no staining 
signal. 


maturation and fertilization (Supplementary Figs 11-13 and Sup- 
plementary Table 1). Male germ cell development and sperm DNA 
methylation were not affected either (Supplementary Fig. 14). 
However, fecundity of the female CKO mice was significantly lower 
in terms of the frequency of successful pregnancy per mating and the 
litter size (Fig. 3a and Supplementary Table 2). 

Deletion of maternal Tet3 did not seem to affect the pre-implanta- 
tion development as heterozygous zygotes collected from CKO 
females mated with wild-type males developed to blastocysts in vitro 
normally (Supplementary Table 3). We then examined the effect of 
maternal Tet3 deletion on prenatal development by transplantation of 
2-cell embryos into oviducts of pseudo-pregnant females. Whereas the 
transferred embryos lacking maternal Tet3 implanted normally, they 
showed a much reduced rate of full-term development (Supplemen- 
tary Table 4). Dissection of transferred [Tet3 Mat /Pat*] mutant 
embryos in the pregnant wild-type foster mothers revealed a high 
frequency of degeneration and the appearance of morphological 
abnormalities, starting from midgestation (Fig. 3b and Supplemen- 
tary Table 4). Moreover, deletion of Tet3 later from growing oocytes 


29 SEPTEMBER 2011 | VOL 477 | NATURE | 607 


©2011 Macmillan Publishers Limited. All rights reserved 


a b 
Tet3 
Mat-/Pat* 


5hmC 
Mat" /Pat* 


Mat*/Pat- 


3" PNS-4 


Mat-/Pat* 
: d 
189 te is 
t _ 15 the Mat /Pat* 
i @ 42 

oO 

= 

o 9 

LL 

oO 

wm 6 


o PN3-4 


% 214% % 


using Zp3-Cre also led to failure in zygotic 5hmC generation, retention 
of paternal 5mC, impaired demethylation at Linel and Oct4, and 
compromised embryonic development (Supplementary Fig. 15). 
Therefore, lacking maternal Tet3 blocks paternal genome reprogram- 
ming and causes markedly increased developmental failure of the 
embryo. 

Somatic cell nuclei injected into eggs undergo profound epigenetic 
reprogramming, including DNA demethylation. The cytoplasm of 
germinal vesicle (GV) oocytes’*, metaphase II (MII) oocytes’, 
zygotes” and 2-cell embryos”' has been shown to possess reprogram- 
ming activity. The existence of the Tet3 protein across these stages 
indicated that Tet3 might be one of the cytoplasmic factors contri- 
buting to the reprogramming activity. We tested therefore whether 
Tet3-mediated hydroxylation is part of the reprogramming process in 
somatic cell nuclear transfer (SCNT). Remarkably, following activa- 
tion of nuclear transfer (NT) oocytes reconstructed after injection of 
somatic nucleus into oocyte with or without enucleation (intact 
oocyte), the Tet3 protein originating from the oocyte cytoplasm 
became concentrated in the pseudo-pronucleus (PPN) formed from 
the transferred somatic nucleus, but not in the female pronucleus 
derived from the spindle-chromosome-complex that existed in 
oocytes (Fig. 4a). Significantly, the PPN formed in the NT embryos 
from wild-type intact or enucleated oocytes underwent 5mC oxida- 
tion whereas this modification did not occur in embryos derived from 
Tet3-null oocytes (Fig. 4b). Whereas substantial demethylation was 
detected at the Oct4 promoter in the PPN of Tet3-proficient NT 
embryos, the somatic hypermethylation persisted when the oocyte 
Tet3 was deleted (Fig. 4c). To investigate the role of Tet3-mediated 
DNA oxidation in the activation of pluripotency genes, we used 
donor somatic cells carrying the Oct4-EGFP transgene. Compared 
with embryos from wild-type oocytes, the EGFP signal was signifi- 
cantly lower in embryos derived from the Tet3-null oocytes at the 


608 | NATURE | VOL 477 | 29 SEPTEMBER 2011 


Figure 2 | The role of Tet3 in 5mC oxidation, 
demethylation of paternal DNA, and activation 
of the paternal Oct4 allele. a, Loss of the Tet3 
protein in heterozygous zygotes (Mat  /Pat*) 
obtained from CKO females mated with wild-type 
males. PN3 zygotes were stained with an anti-Tet3 
antibody raised against the deleted region. Scale 
bar, 25 tm. b, 5hmC (green) and 5mC (red) 
immunostaining in wild-type (+/+) and 
maternally (Mat /Pat*) or paternally (Mat* / 
Pat ) Tet3-deficient zygotes at the PN5 stage. 

c, Methylation analysis of Line1 and Oct4 (PE, the 
proximal enhancer region) in male pronuclei 
isolated from wild-type (+/+) and Tet3-deficient 
(Mat /Pat*) zygotes. Open and filled circles 
represent unmethylated and methylated CpG sites, 
respectively. Percentage of methylated CpGs is 
indicated. d, Paternal Oct4 activation in wild-type 
embryos (+/+) and embryos lacking maternal 
Tet3 (Mat /Pat*). Embryos were derived from 
ICSI using sperm carrying the Oct4-EGFP 

ke transgene and the EGFP signal was quantified, 
relative to the level in a 2-cell blastomere. Number 
of embryos analysed at each stage, +/+: 2-cell, 13; 
4-cell, 9; 8-cell, 9; morula, 12; blastocyst, 7. Mat / 
Pat*: 2-cell, 11; 4-cell, 9; 8-cell, 8; morula, 8; 
blastocyst, 5. Error bars indicate s.e.m. **P < 0.01 
and ***P < 0.001. 
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Figure 3 | Maternal Tet3 deficiency compromises embryonic development. 
a, Reduced efficiency of productive mating and litter size in females (f/—) with 
germline Tet3 deficiency. Error bars indicate s.e.m. [Tet3”*, TNAP-Cre] mice, 
n=9; [Tet3", TNAP-Cre] mice, n = 7. b, Developmental failure among 
embryos lacking maternal Tet3. Pregnant foster females receiving wild-type 
(+/+) and mutant (Mat /Pat*) 2-cell embryos transferred were dissected at 
different stages of gestation. Numbers in brackets indicate proportion of 
embryonic day 8.5 (E8.5) deciduas and E11.5-18.5 embryos showing a smaller 
size and morphological abnormalities. The arrow indicates degenerated 
conceptuses obtained. Detailed data are presented in Supplementary Table 4. 
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Figure 4 | Tet3 contributes to reprogram the somatic nucleus transferred 
into oocytes. a, Tet3 enrichment in the pseudo-pronucleus (PPN) formed in 
embryos reconstructed by transplantation of cumulus somatic nuclei into an 
intact (top row) or enucleated oocyte (middle row). The enrichment did not 
occur in the female pronucleus in intact oocytes (top row) and in the two 
pronuclei of parthenogenetic embryos (bottom row). Scale bar, 25 um. 

b, Impaired 5mC oxidation in PPN of 1-cell NT embryos derived from injection 
of cumulus nuclei into intact or enucleated oocytes lacking Tet3. Embryos 
around 10h post activation (hpa) were stained with anti-ShmC (green) and 


8-cell and morula stages (Fig. 4d and Supplementary Fig. 16a). 
Consistently, the Oct4 mRNA level was relatively low in 8-cell embryos 
from Tet3-null oocytes (Supplementary Fig. 16b). Therefore, defi- 
ciency in oocyte Tet3 could cause weakened or delayed activation of 
the somatic Oct4 in NT embryos. 

We have obtained substantial evidence that oxidation of 5mC in the 
paternal genome in fertilized eggs by Tet3 initiates DNA demethyla- 
tion and facilitates the activation of the paternal copy of early embry- 
onic genes, thus contributing to the establishment of biparental 
totipotency in the early embryo by counteracting the silencing function 
of 5mC. Blocking oxidation of 5mC by Tet3 deletion affects paternal 
gene activation, leading to reduced developmental fitness and fetal 
survival. It remains to be determined whether the developmental 
failures could be caused by haploinsufficiency” for the genes affected 
in the early embryo. 

The involvement of Tet3 in somatic Oct4 activation indicates that 
Tet-mediated 5mC oxidation contributes to epigenetic reprogram- 
ming of the donor nuclear DNA in SCNT. Reprogramming in 
SCNT might thus share a common mechanism with paternal genome 
remodelling in fertilized eggs. Further investigations are needed to 
reveal the signals regulating Tet3 and the events subsequent to 
5hmC formation in both fertilized and cloned embryos. 
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anti-5mC (red) antibodies. c, Impaired Oct4 demethylation in NT embryos 
derived from Tet3-null oocytes. More than 20 NT embryos derived from 
enucleated wild-type (WT) or Tet3-null (KO) oocytes were collected 6-8 hpa 
for DNA methylation analysis. d, Weakened activation of the somatic Oct4- 
EGFP reporter in cultured NT embryos from Tet3-null oocytes. The EGFP 
intensities were relative to the level in a 4-cell blastomere. Error bars indicate 
s.e.m. Number of embryos analysed for each stage, WT: 4-cell, 11; 8-cell, 13; 
morula, 9; blastocyst, 7. KO: 4-cell, 8; 8-cell, 7; morula, 6; blastocyst, 7. 
**D< (0.01. 


METHODS SUMMARY 

Preparation of anti-5[hmC antibody. To prepare anti-ShmC antibody, 
5-hydroxymethylcytidine 5'-monophosphate (5hmCMP) hapten was synthesized 
from cytidine 5’-monophosphate (CMP) using MilA hydroxymethylase**. The 
ribonucleoside was then conjugated to bovine serum albumin (BSA) as previously 
described” and used to immunize rabbits. The antibody was affinity-purified from 
antiserum with 54mCMP-BSA conjugate coupled to agarose beads. 
Immunostaining for 5hmC in fertilized oocytes. Immunofluorescence detec- 
tion of 5hmC in zygotes followed the procedure for 5mC''. Fluorescent images 
were acquired at 2-j1m Z-axis intervals using a confocal microscope (LEICA TCS 
SP5 II) and their signal intensity was determined. 

Gene targeting. A Tet3 targeting vector was electroporated into 129Sv ES cells for 
homologous recombination. The floxed region contains exons 8-9, which includes 
the coding region for the conserved Fe**-binding motif of dioxygenases. To 
inactivate Tet3 in the germ line, we crossed the mice carrying a floxed allele with 
TNAP-Cre knock-in mice on a 129 genetic background”. The conditional knock- 
out mice were on C57BL/6J-129Sv genetic background. 

Zygote collection and staging. Female mice 4- to 8-week-old were injected with 
pregnant mare’s serum gonadotropin and human chorionic gonadotropin were 
mated with wild-type males. Zygotes were harvested and the PN stages of indi- 
vidual zygotes were classified according to ref. 11, by taking into account the 
pronuclear morphology and the presence of 5mC signal. 

Observation of Oct4-EGFP activation. To examine the role of Tet3 in paternal 
gene activation, MII oocytes collected from wild-type and Tet3 CKO females were 
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fertilized by ICSI with sperm from transgenic mice (tg/tg) carrying the Oct4-EGFP 
transgene” and the resulting embryos were cultured to observe EGFP expression. 
To examine the role of Tet3 in somatic gene activation in SCNT, we monitored the 
Oct4-EGFP transgene expression from the donor cell DNA in NT embryos, which 
were derived after injection of a cumulus nucleus into a wild-type or a Tet3-null 
oocyte without enucleation. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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Preparation of anti-ShmC and anti-Tet3 antibodies. To prepare 5hmC anti- 
body, 5-hydroxymethylcytidine 5'-monophosphate (5hmCMP) hapten was syn- 
thesized from cytidine 5’-monophosphate (CMP) in a reaction containing 
formaldehyde and tetrahydrofolate catalysed by recombinant MilA hydroxy- 
methylase’. The ribonucleoside hapten was then conjugated to BSA as previously 
described™ for immunization of rabbits (Supplementary Fig. 1). Mass spectral 
analysis confirmed that the 54mCMP base moiety was unaltered under the con- 
jugation condition. The antibody was affinity-purified from antiserum with the 
5hmCMP-BSA conjugate coupled to agarose beads. To ensure specificity, cross- 
reactivity was removed by incubation with agarose beads crosslinked with CMP- 
BSA conjugate. 

For the detection of Tet3, two rabbit polyclonal antibodies were raised against a 

C-terminal region (amino acids 1159-1329, GenBank accession number 
NP_898961) and the targeted region (amino acids 887-962) respectively, affinity- 
purified and evaluated as described previously”. 
Immunostaining for 5hmC and Tet3 in fertilized oocytes. Immunofluo- 
rescence detection of 5hmC in zygotes derived from natural matings followed 
the procedure for 5mC"'. The signal was detected by Alexa Fluor-conjugated goat 
anti-rabbit or anti-mouse IgG (see Supplementary Table 4 for detailed antibody 
information). Fluorescent images were acquired at 2-1m Z-axis intervals using a 
confocal microscope (Leica TCS SP5 II) and their signal intensity was determined 
using the Leica Application Suite-Advanced Fluorescence software. 

The distribution of the Tet3 protein in embryonic cells was determined on 
embryos fixed with 4% paraformaldehyde (PFA) and permeabilized with 0.5% 
Triton. 

Gene targeting. A targeting vector for Tet3 was prepared using the recombineer- 
ing technique” and electroporated into 129Sv ES cells for selection of targeted 
clones. The floxed region contains exons 8-9, which code for the region (76 amino 
acids from EEVLR to NGCTV, GenBank accession number NP_898961) contain- 
ing the conserved Fe*'-binding motif of the catalytic domain. Deletion of the 
floxed region leads to the loss of 76 amino acids with in-frame fusion between 
exons 7 and 10. Neomycin-resistant embryonic stem clones were screened by PCR 
using a pair of primers crossing the shorter right homologous arm. Positive clones 
were further characterized by Southern blotting to confirm homologous recom- 
bination on the left side of the targeted genomic region (refer to Supplementary 
Fig. 7a). Eembryonic stem cells carrying a correctly targeted allele (with neo) were 
injected into blastocysts to generate germline chimaeras. Mice with a floxed allele 
were obtained by breeding with C57BL/6J mice. The neo selection marker was 
removed in mice by crossing with ACTFLPe mice”’. To inactivate Tet3 in germ 
cells from the PGC stage onwards, we generated conditional knockout mice by 
crossing floxed mice with TNAP-Cre knock-in mice. TNAP-Cre is expressed in 
primordial germ cells from embryonic day 9.5 to late gestation®®. To inactivate 
Tet3 in female germ cells from the growing oocyte stage, we generated conditional 
knockout mice by crossing floxed mice with Zp3-Cre transgenic mice which 
express Cre exclusively in growing oocytes*’. Mice were genotyped by PCR 
(primer sequences are presented in Supplementary Table 6). The conditional 
knockout mice were on a mixed C57BL/6J-129Sv genetic background. 

Zygote collection. Wild-type BDF1 (from C57BL/6 9 X DBA2 ©) female mice 
4- to 8-week-old injected with pregnant mare’s serum gonadotropin and human 
chorionic gonadotropin were mated with BDF1 male. Zygotes were harvested at 
different time points after human chorionic gonadotropin injection. The PN stage 
of each individual zygote was classified according to ref. 11, by taking into account 
the pronuclear morphology and the presence of 5mC signal. We stained the DNA 
with DAPI (for fixed zygotes) or Hoechst 33258 (for live zygotes), and used a 
Nomarski differential interference contrast microscope for better observation of 
zygotic pronuclei, when necessary. Tet3-deficient zygotes were obtained from 
[Tet3”, TNAP—Cre] female mice crossed with wild-type males. 

Fertility test. [Tet. ’~ TNAP—Cre] female mice at 6-8 weeks of age were housed 
with 8- to 12-week-old wild-type males of proven fertility. Rate of fertilization was 
judged by the presence of two pronuclei in zygotes collected from females after 
pregnant mare’s serum gonadotropin and human chorionic gonadotropin treat- 
ment and mating with wild-type male mice of proven fertility. Number of com- 
pleted pregnancies per plug seen (litters per plug) and number of viable pups born 
per litter (litter size) were calculated from the data included in Supplementary 
Table 2. [Tet. ’* TNAP—Cre] female mice were used as a control group for 
comparison. Student’s t-test was performed to compare averages in the two dif- 
ferent experimental groups and P < 0.05 was considered to be significant. 
Isolation of primordial germ cells (PGCs). Female CF1 (Charles River 
Laboratories) mice were mated with male OG2 (ref. 32) mice. This transgenic 
mouse line expresses the enhanced green fluorescent protein (EGFP) from the 
Oct4 promoter and thus enables the selective purification of embryonic and fetal 
germ cells. Embryo parts enriched in PGCs and genital ridges were dissected at 9.5 
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and 11.5 days post coitum (dpc), respectively. At 9.5 dpc and 11.5 dpc, the sex of 
the embryo was determined by real-time PCR amplification of two genes in a 
single reaction. The Sry amplicon indicated the presence of Y chromosome and 
male sex, whereas amplification of the Surpn gene served as a positive control for 
DNA. Gonads were dissected from male and female fetuses at 13.5, 15.5 and 
17.5 dpc. Male and female gonads were distinguished by their distinct morphology 
at these stages. Gonads were incubated at 37 °C for 15 min in trypsin-EDTA and 
triturated to achieve a single cell suspension containing germ cells and somatic 
cells. Dulbecco’s Modified Eagle’s Medium (DMEM) (Invitrogen) supplemented 
with 20% FBS was added to inactivate trypsin. Cell suspensions were analysed and 
sorted on a MoFlo flow cytometer (Beckman Coulter). Data were acquired using 
488 nm excitation from an Innova-306 Argon laser (Coherrent) at 500 mW. EGFP 
emission was measured through a 530DF30 filter (Omega Optical). 
Quantitative reverse transcription PCR. Poly(A+) mRNAs were isolated from 
zygotes (n = 200) and female PGCs (embryonic days 9.5, 11.5, 13.5, 15.5 and 17.5) 
by using the Dynabeads mRNA DIRECT Micro Kit (Invitrogen). Oligo (dT)25- 
coupled Dynabeads and mRNA complexes were immediately used for reverse 
transcription using SuperScript III reverse transcriptase (Invitrogen), according 
to the manufacturer’s instructions. Real-time quantitative PCR reactions were 
performed at 50°C for 2min and 95°C for 10min followed by 50 cycles at 
95°C for 15s and 60°C for 1 min using TaqMan Gene Expression Master Mix 
(Applied Biosystems) on an iQ5 real-time PCR cycler (Bio-Rad). PCR was per- 
formed with TaqMan MGB primers with 6FAM-based probes (Applied 
Biosystems) using the following assay ID numbers: Tet1 (Mm01169088_m1), 
Tet2 (Mm01312907_m1), Tet3 (Mm00805754_m1) and _ Stella/Dppa3 
(Mm01184198_g1). The cDNA levels of target genes were analysed using a com- 
parative C, method and normalized to the internal standard gene Gapdh. 
Collection of oocytes and production of parthenogenetic embryos. For collec- 
tion of GV oocytes, the ovaries were removed from the female mice 42-44h after 
pregnant mare’s serum gonadotropin injection. Antral follicles were punctured by 
30G needles, and cumulus-enclosed GV oocytes were released into HEPES- 
buffered CZB medium (HCZB) containing 0.2 mM 3-isobutyl-1-methylxanthine 
(IBMX) to inhibit germinal vesicle breakdown. Cumulus cells were removed by 
pipetting. For collection of mature oocytes, oviducts were removed from the 
female mice 13-15h after human chorionic gonadotropin injection. Cumulus- 
oocyte complexes were released into HCZB containing 0.1% bovine testicular 
hyaluronidase (300 USP units per mg; ICN Biomedicals Inc.). MII oocytes were 
activated for 6h in activation medium (calcium-free CZB medium containing 
10mM Sr** and 5 pg ml cytochalasin B) to generate parthenogenetic embryos, 
which were cultured in KSOM medium with amino acids and harvested at 8 h post 
activation. 

Intracytoplasmic sperm injection (ICSI). ICSI was performed according to the 
method of ref. 33 except for being performed at room temperature (about 25 °C). 
Briefly, sperm were collected from adult mice (Oct-delta PE-GFP #18) carrying 
Oct4-EGFP transgene (tg/tg)”° and the head was separated from the tail by apply- 
ing pulses to the head-tail junction by means of a Piezo-driven pipette 
(Piezoelectric actuator; PrimeTech). Only the sperm head was injected into each 
oocyte. Injected oocytes were cultured in KSOM medium for 96 h to examine their 
development in vitro. Images of resulting embryos were acquired with an IX51 
inverted microscope (Olympus) under the same exposure parameters and the 
EGFP intensity of each embryo was quantified with Image] software”. 

Embryo transfer and Caesarean section. Fertilized eggs derived from natural 
matings were cultured in KSOM medium until the 2-cell stage. Two-cell embryos 
were then transferred into oviducts of surrogate females at day 1 of pseudopreg- 
nancy. For strict comparison, eight mutant and control 2-cell embryos 
(Supplementary Table 4) were transferred into the left and right oviducts of 
recipients, respectively. Recipient mothers were euthanized at 8.5, 11.5 and 14.5 
days of gestation and embryos were dissected. For embryos developed to term, 
caesarean section was performed on day 19 and living pups were nursed by 
lactating ICR females. 

Isolation of male pronuclei. Male pronuclei, which were distinguished from 
female pronuclei on the basis of their size and distance from polar bodies were 
harvested from zygotes of PN3-4 stages by breaking the zona using Piezo drive 
(Prime Tech) and aspirating using a micromanipulator. At least 40 male pronuclei 
from control or Tet3-deficient zygotes were collected and subjected to bisulphite 
sequencing analysis. 

Bisulphite sequencing. For DNA methylation analysis in oocytes, pronuclei from 
zygotes and pseudo-pronuclei from NT embryos with limited numbers, bisulphite 
conversion was performed in agarose beads as described**. Unbiased amplification 
for methylated and unmethylated sequences was ensured by testing bisulphite 
PCR primers using a 1:1 mixture of unmethylated and in vitro methylated DNA 
fragments. The PCR products were cloned into pMD19-T vectors (Takara Inc.) 
and individual clones were sequenced by BGI Ltd, Shanghai. Bisulphite primer 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


information is presented in Supplementary Table 6. For the determination of the 
methylation state of each sequence, the experiment was performed at least twice 
starting from the isolation of cells, pronuclei and embryos. 

DNA immunoprecipitation with anti-5mC and anti-ShmC antibodies. To 
detect the existence of 5hmC at Line1 repeats in the paternal genome in mouse 
zygotes, >100 male pronuclei were harvested from zygotes at PN3-4 stages, 
digested with proteinase K and RNase A, and the genomic DNA was purified 
by phenol-chloroform extraction. The genomic DNA was mixed with 250 ng of 
carrier lambda DNA (dam , dem’) and fragmented by Alul digestion, heat- 
denatured (10min, 95°C), and immunoprecipitated as described previously” 
using 1 jg of anti-5hmC or anti-5mC antibodies (Eurogentec, BI-MECY-1000) 
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G protein-coupled receptors represent the largest family of mem- 
brane receptors’ that instigate signalling through nucleotide 
exchange on heterotrimeric G proteins. Nucleotide exchange, or 
more precisely, GDP dissociation from the G protein a-subunit, is 
the key step towards G protein activation and initiation of down- 
stream signalling cascades. Despite a wealth of biochemical and 
biophysical studies on inactive and active conformations of several 
heterotrimeric G proteins, the molecular underpinnings of G 
protein activation remain elusive. To characterize this mechanism, 
we applied peptide amide hydrogen-deuterium exchange mass 
spectrometry to probe changes in the structure of the heterotri- 
meric bovine G protein, Gs (the stimulatory G protein for adenylyl 
cyclase) on formation of a complex with agonist-bound human 
B2 adrenergic receptor (B.AR). Here we report structural links 
between the receptor-binding surface and the nucleotide-binding 
pocket of Gs that undergo higher levels of hydrogen-deuterium 
exchange than would be predicted from the crystal structure of the 
B2AR-Gs complex. Together with X-ray crystallographic and elec- 
tron microscopic data of the B,AR-Gs complex (from refs 2, 3), we 
provide a rationale for a mechanism of nucleotide exchange, 
whereby the receptor perturbs the structure of the amino-terminal 
region of the a-subunit of Gs and consequently alters the ‘P-loop’ 
that binds the B-phosphate in GDP. As with the Ras family of 
small-molecular-weight G proteins, P-loop stabilization and 
B-phosphate coordination are key determinants of GDP (and 
GTP) binding affinity. 

The formation of a complex between a G protein-coupled receptor 
(GPCR) and a heterotrimeric G protein is responsible for the majority 
of transmembrane signalling in response to hormones and neurotrans- 
mitters. Heterotrimeric G proteins are composed of a nucleotide- 
binding «-subunit (Ga) and an obligate dimer of the B and y-subunits 
(GBy). In their inactive form, Ga-subunits are bound to GDP and 
tightly associated with GBy. Ga proteins are evolutionarily related to 
the Ras family of G proteins but contain a small globular domain, 
referred to as the o-helical domain (GasAH) (Fig. la). The Ras-like 
GTPase domain (GasRas) contains most of the catalytic residues 
necessary for GTP hydrolysis, as well as the GBy and effector binding 
regions’. The structures within these regions differ between GTP- and 
GDP-bound states, and are appropriately termed switch regions”®. 

The process of G protein activation involves formation of a complex 
(often referred to as a ternary complex) consisting of an agonist- 
occupied GPCR and a nucleotide-free G protein heterotrimer. In a 
companion manuscript’, we present the B,AR-Gs crystal structure 
(Fig. 1b), which provides the first high-resolution snapshot of such 
ternary complex. However, a static view of this structure provides only 
limited insight into the mechanism of GDP release, the first step in G 
protein activation. In an effort to provide dynamic insights into the 
mechanism of receptor-mediated activation of Gs, we studied the 


effect of agonist-bound B,AR on the structure of Gs using peptide 
amide hydrogen-deuterium exchange mass spectrometry (DXMS, 
see Online Methods). The rates of exchange of the amide hydrogens 


Activated B,AR 


Activated B,AR 


Figure 1 | Receptor-mediated activation of G proteins through promoting 
GDP release. Agonist (BI-167107)-bound receptor-mediated interaction with 
heterotrimeric G proteins promotes nucleotide (GDP) release in the G protein 
a-subunit. a, Structures of 8. AR (green) and G protein heterotrimer (Gas). 
The nucleotide-binding Ga-subunit (grey) is composed of a Ras-homology 
domain (GasRas) and an o-helical domain (GasAH). b, The crystal structure of 
BI-167107 (agonist)-bound BAR with Gs reveals large domain movements in 
the o-subunit (yellow) of Gs that are associated with nucleotide release. 


Department of Molecular and Cellular Physiology, Stanford University School of Medicine, Stanford, California 94305, USA. *Department of Neuroscience and Pharmacology, The Panum Institute, 
University of Copenhagen, 2200 Copenhagen N, Denmark. ?Department of Medicine, Biomedical Sciences Graduate Program and UCSD DXMS Proteomics Resource, University of California San Diego, La 
Jolla, California 92023, USA. “Department of Pharmacology, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA. “Department of Chemistry, University of Wisconsin, Madison, 
Wisconsin 53706, USA. +Present address: Department of Bionano Engineering, Hanyang University, Ansan 426-791, Korea. 


*These authors contributed equally to this work. 


29 SEPTEMBER 2011 | VOL 477 | NATURE | 611 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


in a protein are a function of the protein’s thermodynamic stability, 
most particularly the stability of the hydrogen bonds that each amide 
forms in the protein’s native structure. In general, an amide hydrogen 
involved in an intra-molecular hydrogen bond, such as those that 
occur ina B-strand or o-helix ina relatively stable region of the protein, 
exchanges more slowly than amide hydrogens forming less stable 
bonds’**, Changes in exchange rates reflect the propensity to forma- 
tion or disruption of these bonds, thereby providing information about 
protein structural changes and dynamics. 

We examine the amide hydrogen exchange behaviour of (1) the Gs 
heterotrimer alone, (2) Gs in complex with B,AR, (3) following 
uncoupling (dissociation) of the B,AR-Gs complex with transition 
state analogue, GDP/aluminium fluoride (AIF; or [AIF,] ), and (4) 
when the B,AR-Gs complex is exposed to GDP (Fig. 2, Supplementary 
Figs 2-9a). When comparing the Gs heterotrimer with the B;AR-Gs 
complex, we observe marked changes in exchange rates for Gus (Gs 
protein o-subunit), but little change in the exchange rates of GB or Gy 


a Gs — complex 


GasRas 


c¢ Complex — GDP 
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(Supplementary Figs 2-5). In addition, we do not report exchange 
behaviour on the B,AR itself because comparatively limited peptide 
probe sequence coverage was obtained for the receptor when we used 
DXMS analysis conditions optimal for the G proteins’’. 
Transitioning from the GDP-bound heterotrimer to the nucleotide- 
free B.AR-Gs complex is associated with changes in exchange rates 
throughout the Ga subunit (Fig. 2a). There is a general increase in 
exchange in the segments that form the nucleotide binding pocket, 
consistent with a loss of polar interactions with GDP or as a result of 
alterations in protein structure following GDP dissociation. We 
observe reduced exchange at the carboxy-terminal portion of the 
a5-helix when bound to BAR (Figs 2a and 3a, b). This is not surpris- 
ing, considering the crystal structures of opsin and metarhodopsin 
bound to a C-terminal peptide of transducin'*”°, and of the B,AR 
bound to Gs (ref. 2); in these three structures, the C terminus of Go 
subunits penetrates into the cytoplasmic core of the transmembrane 
bundle of GPCRs. This finding is also consistent with a plethora of 


Figure 2 | DXMS reveals conformational 
changes in Gas when in complex with agonist- 
bound £,AR in solution. Pairwise comparisons of 
DXMS of Gus under different conditions. 

a, Changes in DXMS of Gas following formation of 
the B,AR-Gs complex relative to Gus in the Gs 
heterotrimer. b, Changes in DXMS of Gas in the 
B,AR-Gs complex following dissociation of the 
complex with GDP/AIF;. c, Changes in DXMS of 
Gas in the B,AR-Gs complex following the 
addition of GDP alone. The changes in amide 
hydrogen-deuterium exchange (HX), given on the 
colour scale (bottom) as changes in the percentage 
of the theoretical maximum number of deuterons 
incorporated per peptide, were mapped on to the 
crystal structure of Gas based on the GTPyS- 
bound form (PDB 1AZT)”". Residues displaying 
increases (magenta) and decreases (blue) in 
deuterium incorporation when comparing 
different conditions were plotted according to the 
indicated heat map. Regions where no mass 
spectrometry data were obtained are indicated in 
grey. Among three time points analysed (see 
Supplementary Fig. 2), 100s time points are 
presented. 
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Figure 3 | Agonist-bound B,AR-mediated activation of GDP release 
revealed by DXMS. a, Conformational changes in the GasRas of Gs when in 
complex with agonist-bound B,AR (mauve). Highlighted are the changes in 
deuterium exchange (HX), according to the indicated heat map, of key regions 
of GasRas mapped onto the crystal structure of the complex (ref. 2). Colour 
scale applies to all panels, a-d. b, c, Comparison of the structure of Gas in 
complex with §,AR with that of Gai (the inhibitory G protein for adenylyl 
cyclase bound to GDP), highlighting regions of increases or decreases in HX in 
the B6-strand-«5 helix (b) or in the B1-strand, P-loop and «1-helix (c). As 


biochemical and biophysical evidence (reviewed in ref. 17), as well as 
mutagenesis studies and peptide analysis that suggest that the C ter- 
minus determines receptor coupling specificity”*. 

In contrast to the C-terminal region of «5, we observe a dramatic 
increase in exchange rates of peptide amides in the N-terminal segment 
of the «5-helix, suggesting an increase in the dynamics or disordering 
of this segment that is not evident in the structure of the B,AR-Gs 
complex (Figs 2a, 3a, b, and Supplementary Fig. 11a). The 86-05 loop 
constitutes part of the nucleotide-binding pocket. Thus, increases in 
exchange at the N-terminal portion of the «5-helix probably reflect the 
sensitivity to nucleotide release (Figs 2a, 3b). The reintroduction of 
GDP reverses the effects observed on formation of the complex in this 
region (Fig. 2c). These observations are consistent with EPR spectro- 
scopy studies suggesting movement of the «5-helix of the G protein 
transducin on interacting with photo-activated rhodopsin”. 

As expected, uncoupling Gs from B,AR with GDP/AIF; reverses all 
exchange changes induced by receptor binding (Fig. 2b). GDP/AIF; is 
a transition state analogue that structurally and functionally mimics 
GTP when bound to G proteins. Interestingly, the overall effect of 
adding GDP on exchange rates was similar to the effect of adding 
GDP/AIF;; however, we did not observe a comparable increase in 
exchange at the C-terminal end of the «5-helix (Fig. 2c). These results 
suggest that the C-terminal peptide of GDP-bound Gas may remain 
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above, Gas (in b and c) is coloured according to the indicated heat map 
representing changes in HX when comparing Gs heterotrimer with the 
nucleotide-free },AR-Gs complex. Residues where no mass information was 
obtained are coloured charcoal grey. The superimposed structure of Gai bound 
to GDP (transparent grey) is based on the heterotrimeric GDP-bound form”, 
but illustrating the conserved glutamate (E50 in Gas) in the GasRas and 
arginine (R102 in Gas) located in the GasAH. d, Five amide hydrogens, 
contributed by the highly conserved ‘RLLL’ motif, are involved in stabilizing the 
B1-strand in a peptide fragment of Gas that displays high HX. 


weakly coupled to the receptor, and are consistent with crosslinking 
experiments performed under similar conditions showing persistent 
interactions between BAR and Gs in the presence of GDP (Sup- 
plementary Fig. 10). It is tempting to speculate that this GDP-bound 
B.AR-Gs complex may represent a ‘pre-coupled’ receptor complex. 
Nevertheless, it is likely that the GDP-bound B,AR-Gas complex is 
not very stable, because increased exchange at the «5-helix C terminus 
could be observed following longer exchange durations or incubations 
at higher (but still physiological) temperatures (see Supplementary 
Figs 2 and 10). 

The most striking effect of 8.AR-Gs complex formation on amide 
hydrogen exchange was observed in the {1-strand, which links the 
second intracellular loop (ICL2) of the agonist-bound receptor to the 
P-loop that coordinates the B-phosphate of GDP in Gas (Figs 2a, 3a, c). 
The large increase in exchange rate in the f1-strand is unexpected on the 
basis of relatively small structural changes in 81-01 loop observed when 
comparing the structures of Gai bound to GDP with the B,AR-Gs 
complex (Fig. 3c). The relatively minor differences observed in the crystal 
structures may not reflect the change required for GDP release, because 
the crystallization process often favours the lowest energy and most 
stable conformation. In contrast, DXMS experimentation interrogates 
the entire ensemble of native-state conformations under physiological 
conditions (pH, salt, protein concentration and temperature)”. Such 
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analysis reveals that the B1-P-loop-«1 region undergoes structural 
changes that markedly alter the stability of amide backbone hydrogen 
bonds. 

The crystal structure of Gas—GTPYS suggests that five amide hydro- 
gens stabilize the backbone of the f1-strand of this peptidic fragment 
(Fig. 3d and Supplementary Fig. 11b). Four of the five amides are 
contributed by the highly conserved Ry ,LLL,; motif, found in all G 
protein «-subunits (see alignment in Supplementary Fig. 12). DXMS 
analysis indicates that 4-5 hydrogens rapidly exchange in this frag- 
ment. We were unable to detect fragments of the neighbouring B3 and 
B4 strands by mass spectrometry, but speculate that alterations in 
exchange might also be observed in these regions. The dramatic 
increase in exchange in this highly conserved motif suggests that 
formation of the B,AR-Gs complex is associated with large changes 
in the structure and/or stability of the B-sheet formed by the 81-5 
strands. 

The increased exchange in the B1-strand, the P-loop and the proxi- 
mal «1-helix is compatible with the well-established mechanism of 
nucleotide exchange in the Ras family of G proteins. Here the p- 
phosphate of GDP is wedged between the highly conserved ‘P-loop’ 
and o1-helix, and a single Mg”* ion that is coordinated by residues 
within the P-loop. GDP release in Ras-like G proteins is manifested 
simply by altering Mg** coordination to the B-phosphate®. Although 
heterotrimeric Ga proteins lack the dependence on Mg”* for GDP 
binding, like all G proteins they display a 10°-10°-fold preference for 
GDP over GMP*. The structural similarities between the Ras and 
heterotrimeric family of G proteins suggest that the mechanism for 
nucleotide exchange should be similar. Thus, a critical action of the 
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Figure 4 | Deuterium exchange at the interface between the g-helical and 
Ras homology domains. a, Changes in HX at the interface between the 
GasAH (ribbon diagram) and GasRas (surface rendering) are mapped onto the 
‘open’ conformation of Gas observed in the B,AR-Gs complex (inset). In this 
‘open’ conformation, GasAH and GasRas are coloured according to the 
indicated heat map. Also shown is a ribbon diagram of GasAH in a ‘closed’ 
position (grey) similar to that observed in the crystal structure of the GDP- 
bound Gai heterotrimer. The location of the GDP binding site is shown as 
spheres. b, Surface rendering of a rotated back 90° to show the cytoplasmic side 
of the ‘open’ conformation of nucleotide-free Gas. 
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agonist-bound receptor appears to be the engagement of the N 
terminus of Ga, thereby altering the position and/or stability of the 
B1-strand with associated changes in the structure of the P-loop and 
«1-helix. 

DXMS analysis also reveals regions with higher exchange that 
extend beyond the nucleotide-binding site and encompass the entire 
interface between the GasAH and GasRas (Figs 2, 4). These data are 
consistent with crystallographic and electron microscopy evidence on 
the B,AR-Gs complex (refs 2, 3). These studies and recent double 
electron-electron resonance analyses” suggest that the GasRas and 
GasAH undergo a ‘clam-shell like’ opening in the absence of nucleo- 
tide (see Fig. 4). Nucleotide release appears to result in re-positioning of 
the GasAH, but the observed slow rate of exchange within the core of 
the domain suggests that the domain itself is intact and folded. In the 
GDP- and GTPyS-bound forms of G proteins, a highly conserved 
glutamate residue in the P-loop (E50 in Gas) tightly binds the highly 
conserved catalytic arginine residue (R201 in Gas), which is located in 
the GasAH (Fig. 3c and see alignment in Supplementary Fig. 12). 
However, the side chain of E50 in Gas in the receptor complex is poorly 
ordered in the crystal structure, displaying high B (temperature) 
factors. We propose that alterations in the P-loop structure resulting 
from the loss of B-phosphate coordination disrupts the E50-R201 
interaction and facilitates the dissociation of GasAH from GasRas. 

In summary, DXMS analysis of the B,AR-Gs complex comple- 
ments the crystal structure and provides essential additional mech- 
anistic insights into the structural events linking GPCR-G protein 
complex formation and GDP release. The N and C termini of Gas 
represent the principal structural conduits between the receptor and 
the nucleotide binding pocket (Fig. 5). Although it is not at present 
possible to determine the exact sequence of events leading to nucleo- 
tide dissociation and the formation of a stable complex, we speculate 
that the initial interaction may involve the C terminus of GDP-bound 
Gas and agonist-bound AR. Subsequent interactions between ICL2 
of the B,AR and the «N-helix of Gas are associated with GDP release 
through dynamic changes in the §1-strand that involve disruption of a 


@eeo Gpp (GDP) 
@ece Gppp (GTP) 


Figure 5 | Mechanism for receptor-catalysed nucleotide exchange in 
heterotrimeric G protein a-subunits. Shown is the step-wise dissociation of 
GDP from Gas (orange) by agonist-activated BAR that involves the 
engagement of both the N and the C terminus of Gas. The activation of Gs 
through an activated BAR (green) results in GDP release and subsequent GTP 
binding. The activated receptor engages the C terminus of the «5-helix of Gas 
which undergoes a rigid-body translation upward into the receptor core and 
reorganizes the 36-«5 loop, a region that participates in purine ring binding. In 
a simultaneous or sequential event, ICL2 of the B, AR engages the N terminus of 
the Gus, leading to reorganization of its B1-strand/P-loop, the loss of 
coordination of the §-phosphate of GDP (blue), and subsequently GDP release. 
The position of the N-terminal helix is aided by the GBy-subunits (not shown). 
The concomitant disruption of the interaction between the P-loop and GasAH, 
primarily through the highly conserved R201 in the GasAH and E50 in the 
P-loop, opens GasAH allowing GDP to freely dissociate. Formation of the 
nucleotide-free form allows GTP (grey) to bind, resulting in reformation of the 
‘closed’ conformation, and activation of the G protein through functionally 
dissociating from GBy and uncoupling from BAR. 
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conserved hydrogen bond network and are coupled to structural 
changes in the highly conserved ‘P-loop’ that wraps around the 
B-phosphate of GDP. Thus, in a coordinated event, the agonist-bound 
receptor engages both the N terminus and C terminus to promote 
nucleotide release. 


METHODS SUMMARY 


DXMS studies were performed on purified Gs and B,AR-Gs complex in the 
absence or presence of guanine nucleotides. The B.AR-Gs complex was formed 
from BAR and Gs protein expressed in insect (Spodoptera frugiperda and 
Trichoplusia ni, respectively) cells. Samples were incubated on ice for 100, 1,000 
and 10,000s in the presence of DO, then quenched and frozen for subsequent 
analysis by fragmentation and mass spectrometry. For more experimental details 
see Online Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Expression and purification of B,AR. An B,AR construct truncated at position 
365 (B,AR-365) was expressed in S/9 insect (Spodoptera frugiperda) cell cultures 
(grown in ESF 921 medium, Expression Systems) infected with recombinant 
baculovirus (BestBac, Expression Systems), and solubilized in n-dodecyl-B-p- 
maltoside (DDM) according to methods described previously*’. M1 Flag affinity 
chromatography (Sigma) served as the initial purification step followed by 
alprenolol-Sepharose chromatography for selection of functional receptor. A sub- 
sequent M1 Flag affinity chromatography step was used to exchange receptor- 
bound alprenolol for high-affinity agonist BI-167107. The agonist-bound receptor 
was eluted, dialysed against buffer (20 mM HEPES, pH 7.5, 100 mM NaCl, 0.1% 
DDM and 10 uM BI-167107), treated with PNGaseF and lambda phosphatase 
(New England Biolabs), and concentrated to approximately 50mg ml~! with a 
50kDa molecular weight cut-off Millipore concentrator. 

Expression and purification of Gs heterotrimer. Bovine Gas short, Hisg-rat 
Gf, and bovine Gy were co-expressed in HighFive insect (Trichoplusia ni) cells 
(Invitrogen) grown in Insect Xpress serum-free media (Lonza). Cultures were 
infected with three separate Autographa californica nuclear polyhedrosis viruses, 
each containing the gene for one of the G protein subunits at a 1:1 multiplicity of 
infection. Infected cells were harvested and resuspended in 75 ml lysis buffer 
(50mM HEPES pH 8.0, 65mM NaCl, 1.1mM MgCl, 1mM EDTA, 1x PTT 
(35ugml~' phenylmethanesulphonyl fluoride, 32 1gml~! tosyl-phenylalanyl- 
chloromethyl ketone, 32 jig ml” ' tosyl-lysyl-chloromethyl ketone), 1X LS (3.2 1g 
ml ' leupeptin and 3.21gml~ soybean trypsin inhibitor), 5mM B-ME, and 
101M GDP) per litre of culture volume. The suspension was pressurized with 
600p.s.i. N, for 40min in a nitrogen cavitation bomb (Parr Instrument 
Company). After depressurization, the lysate was centrifuged to remove nuclei 
and unlysed cells, and then ultracentrifuged at 180,000g for 40 min. The pelleted 
membranes were resuspended in 30 ml wash buffer (50mM HEPES, pH 8.0, 
50mM NaCl, 100M MgCh, 1X PTT, 1X LS, 5mM B-ME, 10M GDP) per 
litre culture volume using a Dounce homogenizer and centrifuged again at 
180,000g for 40 min. The washed pellet was resuspended in a minimal volume 
of wash buffer and flash frozen with liquid nitrogen. 

The frozen membranes were thawed and diluted to a total protein concentration 
of 5mgml ' with fresh wash buffer. Sodium cholate detergent was added to the 
suspension at a final concentration of 1.0%, MgCl was added to a final concen- 
tration of 5 mM, and 0.05 mg of purified protein phosphatase 5 (PP5, prepared 
from E. coli culture) was added per litre of culture volume. PP5 cDNA was 
generously provided by M. Chinkers (University of Southern Alabama). The 
sample was stirred on ice for 40min, and then centrifuged at 180,000g for 
40 min to remove insoluble debris. The supernatant was diluted fivefold with 
Ni-NTA load buffer (20mM HEPES, pH 8.0, 363 mM NaCl, 1.25mM MgCl, 
6.25 mM imidazole, 0.2% Anzergent 3-12, 1x PTT, 1X LS, 5mM £-ME, 10 1M 
GDP), taking care to add the buffer slowly to avoid reducing the cholate concen- 
tration below its critical micelle concentration too quickly. 3 ml of Ni-NTA resin 
(Qiagen) pre-equilibrated in Ni-NTA wash buffer 1 (20mM HEPES pH 8.0, 
300mM NaCl, 2mM MgCl, 5mM imidazole, 0.2% sodium cholate, 0.15% 
Anzergent 3-12, 1X PTT, 1X LS, 5mM B-ME, 10M GDP) per litre culture 
volume was added and the sample was stirred on ice for 20 min. The resin was 
collected into a gravity column and washed with 4X column volumes of Ni-NTA 
wash buffer 1, Ni-NTA wash buffer 2 (20 mM HEPES, pH 8.0, 50 mM NaCl, 1 mM 
MgCl:, 10 mM imidazole, 0.15% Anzergent 3-12, 0.1% DDM, 1x PTT, 1x LS, 
5mM £-ME, 10 uM GDP), and Ni-NTA wash buffer 3 (20 mM HEPES, pH 8.0, 
50 mM NaCl, 1 mM MgCl, 5 mM imidazole, 0.1% DDM, 1X PTT, 1X LS, 5mM 
B-ME, 10 4M GDP). The protein was eluted with Ni-NTA elution buffer (20 mM 
HEPES, pH 8.0, 40mM NaCl, 1mM MgCl, 200mM imidazole, 0.1% DDM, 
1X PTT, 1X LS, 5mM £-ME, 101M GDP). Protein-containing fractions were 
pooled and MnCl, was added to a final concentration of 100 .M. Purified lambda 
protein phosphatase (prepared from E. coli culture) was added to the elute (50 pg 
per litre of culture volume, final) and incubated on ice with stirring for 30 min. The 
elute was passed through a 0.22 jum filter and loaded directly onto a MonoQ HR 
16/10 column (GE Healthcare) equilibrated in MonoQ buffer A (20 mM HEPES, 
pH 8.0, 50mM NaCl, 100 uM MgCl, 0.1% DDM, 5mM B-ME, 1X PTT). The 
column was washed with 150 ml buffer A at 5 ml min! and bound proteins were 
eluted over 350 ml with a linear gradient up to 28% MonoQ buffer B (same as 
buffer A except with 1M NaCl). Fractions were collected in tubes spotted with 
enough GDP to make a final concentration of 10 1M. The Gs containing fractions 
were concentrated to 2 ml using a stirred ultrafiltration cell with a 10 kDa NUWL 
regenerated cellulose membrane (Millipore). The concentrated sample was run on 
a Superdex 200 prep grade XK 16/70 column (GE Healthcare) equilibrated in $200 
buffer (20mM HEPES, pH 8.0, 100mM NaCl, 1.1mM MgCb, 1mM EDTA, 
0.012% DDM, 100 uM TCEP, 2 uM GDP). The fractions containing pure Gs were 
pooled, glycerol was added to 10% final concentration, and then the protein was 


concentrated to at least 10 mg ml * using a 30 kDa MWCO regenerated cellulose 
Amicon centrifugal ultrafiltration device. The concentrated sample was then 
aliquoted, flash frozen, and stored at —80°. A typical yield of final, purified Gs 
heterotrimer from 8] of cell culture volume was 6-8 mg. 

Complex formation and purification. For reasons of efficiency and higher 
protein quality, the receptor-G protein (B,AR-365-Gs heterotrimer) complex 
that was used for DXMS and EM (ref. 3) experimentation was derived from 
preparations also intended for protein crystallography”. Heterotrimeric Gs protein 
was mixed at approximately 100 1M concentration with BI-167107 bound B,AR- 
365 in molar excess (approximately 130 1M) in 2 ml buffer (10 mM HEPES pH 
7.5, 100mM NaCl, 0.1% DDM, 1mM EDTA, 3mM MgCh, 10 uM BI-167107) 
and incubating for 3h at room temperature (25 °C). To obtain a nucleotide free 
B,AR-Gs complex, apyrase (25 mU ml ', NEB) was added after 90 min to hydro- 
lyse residual GDP released from Gas on binding to the receptor. The complex was 
exchanged into MNG-3 by adding the B,AR-Gs complex mixture (2 ml) to 8 ml 
buffer (20 mM HEPES pH 7.5, 100 mM NaCl, 10 uM BI-167107) containing 1% 
MNG-3 for 1h at room temperature. The B.AR-Gs complex was loaded onto a 
M1 Flag resin and washed in buffer (20 mM HEPES, pH 7.5, 100 mM NaCl, 10 uM 
BI-167107 and 3mM CaCl,) containing 0.2% MNG-3 and eluted in buffer 
(20mM HEPES, pH 7.5, 100mM NaCl, 10M BI-167107, 5mM EDTA and 
200 ug ml~' Flag peptide) containing 0.02% MNG-3. The B.AR-Gs complex 
was purified following a final size exclusion chromatography step using a 
Superdex 200 10/300 GL column (GE Healthcare) equilibrated with buffer con- 
taining 0.02% MNG-3, 10 mM HEPES pH 7.5, 100 mM NaCl, 10 uM BI-167107 
and 100 uM TCEP. Peak fractions were pooled and concentrated to approximately 
90mgml~' with a 100kDa MWCO Viva-spin concentrator. A representative 
elution profile of the complex on a Superdex 200 column can be found in 
Supplementary Fig. 1. 

Nucleotide treatment. f,AR-Gs complex prepared as described above was diluted 
to 10mg ml! in buffer containing 0.02% MNG-3, 20 mM HEPES, pH 7.5, 100 mM 
NaCl, 10 uM BI-167107 and 100 1M TCEP. The §8;AR-Gs complex was incubated 
with either 1 mM GDP or 1 mM GDP/AIF; for 2h at room temperature. 
Deuterium exchange mass spectrometry. 1.5 11 of B,AR-Gs complex (10 mg 
ml‘) or 1.51 of Gs heterotrimer (7 mg ml ') was mixed with 4.5 ul of DO 
buffer (20mM HEPES, pH 7.5, 100mM NaCl, 10uM BI-167107, 100 uM 
TCEP, 0.0015% MNG-3 in D,O) and incubated for 100, 1,000 and 10,000s on 
ice. At the indicated times, the sample was quenched by 15 il of ice-cold quench 
solution (0.1 M NaH3PO,, 20mM TCEP, 16.6% glycerol, pH 2.4), immediately 
frozen on dry ice, and stored at —80 °C. Non-deuterated control was prepared in 
H,O buffer (20mM HEPES, pH 7.5, 100 mM NaCl, 10 uM BI-167107, 100 mM 
TCEP, 0.0015% MNG-3 in HO), mixed with quench solution, and snap-frozen 
on dry ice. Samples were thawed and immediately passed through an immobilized 
porcine pepsin column (16 til bed volume) at a flow rate of 20 pl min! of 0.05% 
trifluoroacetic acid. Peptide fragments were collected contemporaneously on a 
C18 trap column for desalting and separated by a Magic C18AQ column 
(Michrom BioResources Inc.) using a linear gradient of acetonitrile from 6.4% 
to 38.4% over 30min. Mass spectrometric analysis was performed using LCQ 
Classic mass spectrometer (Thermo Finnigan), with capillary temperature of 
200°C. Deuterium quantification data were collected in MS1 profile mode, and 
peptide identification data were collected in data dependent MS/MS mode. 
Recovered peptide identification and analysis were carried out using DXMS 
Explorer (Sierra Analytics Inc.), specialized software for processing DXMS 
data**”®. 

Before the performance of studies with deuterated proteins, extensive preliminary 
‘fragmentation tuning’ studies were performed with non-deuterated protein 
mixtures. Proteins were acidified, supplemented with various concentrations of 
denaturants (guanidine-HCl, 0.0-4.0 M), supplemental detergents (DDM, 0.0 and 
0.1%) and reductants (TCEP, 0.015-0.5 M); incubated in such solutions for vary- 
ing times at 0 °C; and then subjected to on-line solid-state pepsin proteolysis (45 s 
contact time at either 0 °C or room temperature) and then analysed by LC-MS at 
0°C as described above. The conditions that produced the most comprehensive 
fragmentation map for the G protein component of the mixture (described and 
employed above) were found to be markedly different from those that gave the 
best fragmentation map for the receptor protein component. Addition of supple- 
mental DDM (0.1%), long incubations in concentrations of denaturants (4.0 M), 
or digestion at room temperature were required to obtain even a partial fragmenta- 
tion map of the receptor. Compared to the G protein, the receptor is more highly 
disulphide cross-linked, and hydrophobic in character, probably accounting for 
the requirement for a more aggressive denaturation and protease digestion, 
and supplementation with additional detergent to keep hydrophobic peptides in 
solution. These several conditions degraded the comprehensiveness and resolution 
of the G protein fragmentation map, and considerably increased the deuterium 
losses of the G protein peptides after exchange quench when compared to the G 
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protein-optimal conditions. Given this, our deuteration analysis was subsequently 24. Mendillo, M. L. etal. A conserved MutS homolog connector domain interface 


directed exclusively to the G protein component of the mixture employing analysis poe with MutL homologs. Proc. Nat! Acad. Sci. USA 106, 22223-22008 


conditions optimal for them. 25. Chen, H. etal. Allosteric inhibition of complement function by a staphylococcal 
immune evasion protein. Proc. Nat! Acad. Sci. USA 107, 17621-17626 (2010). 
23. Kobilka, B. K. Amino and carboxyl terminal modifications to facilitate the 26. Li, S. etal. Mechanism of intracellular CAMP sensor Epac2 activation: cAMP- 
production and purification of a G protein-coupled receptor. Anal. Biochem. 231, induced conformational changes identified by amide hydrogen/deuterium 
269-271 (1995). exchange mass spectrometry (DXMS). J. Biol. Chem. 286, 17889-17897 (2011). 
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The structure and catalytic mechanism of a 
poly(ADP-ribose) glycohydrolase 


Dea Slade!**, Mark S. Dunstan**, Eva Barkauskaite', Ria Weston!, Pierre Lafite*, Neil Dixon’, Marijan Ahel°, David Leys? 


& Ivan Ahel! 


Post-translational modification of proteins by poly(ADP-ribosyl) 
ation regulates many cellular pathways that are critical for genome 
stability, including DNA repair, chromatin structure, mitosis and 
apoptosis’. Poly(ADP-ribose) (PAR) is composed of repeating ADP- 
ribose units linked via a unique glycosidic ribose-ribose bond, and is 
synthesized from NAD by PAR polymerases’”. PAR glycohydrolase 
(PARG) is the only protein capable of specific hydrolysis of the ribose- 
ribose bonds present in PAR chains; its deficiency leads to cell death**. 
Here we show that filamentous fungi and a number of bacteria possess 
a divergent form of PARG that has all the main characteristics of 
the human PARG enzyme. We present the first PARG crystal struc- 
ture (derived from the bacterium Thermomonospora curvata), 
which reveals that the PARG catalytic domain is a distant member 
of the ubiquitous ADP-ribose-binding macrodomain family”*. 
High-resolution structures of T. curvata PARG in complexes with 
ADP-ribose and the PARG inhibitor ADP-HPD, complemented by 
biochemical studies, allow us to propose a model for PAR binding 
and catalysis by PARG. The insights into the PARG structure and 
catalytic mechanism should greatly improve our understanding of 
how PARG activity controls reversible protein poly(ADP-ribosyl) 
ation and potentially of how the defects in this regulation are linked 
to human disease. 

Whereas the mechanism and functional aspects of PAR synthesis 
have been extensively characterized, the present understanding of the 
PAR-degradation pathways catalysed by PARG is comparatively poor. 
Until now, the catalytic fold and the mechanism of PARG-mediated 
hydrolysis have remained unknown. Our homology searches revealed 
that the majority of fungal genomes lack close mammalian/canonical 
PARG homologues, but instead possess a divergent PARG-like protein, 
annotated as DUF2263. DUF2263 protein sequences contain the PARG 
signature (GGG-X¢_s-QEE)’, which includes the previously identified 
key residues: two consecutive glutamates’ (Fig. la, black asterisk) and a 
glycine® (Fig. 1a, grey asterisk). In addition, a tyrosine residue (Fig. la, 
black cross) implicated in binding to the PARG inhibitor ADP-HPD? is 
also present in DUF2263. The DUF2263 orthologue is found also in 
other eukaryotes lacking canonical PARGs such as the rotifer Adineta 
vaga (Fig. 1b), where it is fused to several copies of the PAR-binding 
zinc finger’®. As both filamentous fungi and A. vaga have functional 
PARP orthologues and an active PAR metabolism'’”, we postulated 
that DUF2263 was likely to be a functional PARG in these organisms. 
Surprisingly, although bacteria are historically thought to be devoid 
of PAR metabolism, certain bacterial species possess both PARP 
(closely related to PARP1; ref 13) and DUF2263 genes, whereas some 
only contain the DUF2263 homologue (Fig. la, b). To analyse the 
biochemical activities of the corresponding bacterial PARP and 
PARG proteins, we purified His¢-tagged Herpetosiphon aurantiacus 
PARP and DUF2263. H. aurantiacus PARP was able to synthesize 
PAR from NAD, as revealed by antibodies recognizing PAR (Fig. 1c) 


and by the analysis of PAR ladders using sequencing gels (Supplemen- 
tary Fig. 1). Analogous to the human PARP1 enzyme, H. aurantiacus 
PARP was sensitive to the PARP inhibitor KU-0058948 and required 
DNA for its activation (Fig. 1c). Notably, poly(ADP-ribosyl)ation 
produced by H. aurantiacus PARP is efficiently hydrolysed by the 
H. aurantiacus DUF2263 protein, demonstrating that DUF2263 is a 
functional PARG (Fig. 1c) and suggesting the presence of functional 
PAR metabolism in bacteria. 

We extended our analyses to other bacterial and fungal PARGs 
(Fig. 1a, b and Supplementary Fig. 2) using established PAR substrates 
formed by the activity of the well-characterized human PARP1 enzyme. 
Canonical human and Entamoeba dispar PARGs served as controls. 
The in vitro activity of the purified PARG enzymes was evaluated in five 
ways: (1) an assay that measures the loss of PAR from PARP 1-modified 
histones; (2) a western-blot analysis of the hydrolysis of PAR from the 
auto-modified human PARP1 using anti-PAR antibodies; (3) an SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE) analysis of the [**P]- 
PAR auto-modified human PARP1; (4) an analysis of the release of 
ADP-ribose from PAR either by thin-layer chromatography (TLC); or 
(5) by liquid chromatography-mass spectrometry (LC/MS) (Fig. 2a-d 
and Supplementary Fig. 4). The quantification of PAR hydrolysis by the 
PARG assay shows that bacterial and fungal PARGs are highly active in 
hydrolysing PAR in vitro; their catalytic potency is comparable to the 
E. dispar and human PARG enzymes (Fig. 2a). Moreover, the mode of 
action of these proteins and mutant versions on poly(ADP-ribosyl)ated 
PARP1 shows the same pattern observed for the canonical PARGs. 
First, they are all inactivated by the mutation ofa key glutamate residue 
(Fig. 2b, d) and by treatment with the PARG inhibitor ADP-HPD 
(Fig. 2c). Second, they are all active on ribose-ribose bonds between 
the two ADP-ribose units, but they are not efficient on mono(ADP- 
ribosyl)ated protein substrates with the ADP-ribose unit linked directly 
to the PARP1 (Fig. 2c) or the PARP10 protein (Supplementary Fig. 3). 
Last, TLC and LC/MS analyses confirmed that the main product of 
DUF2263-type PARGs is ADP-ribose (Fig. 2d and Supplementary 
Fig. 4). Collectively, these results demonstrate that bacterial and fungal 
PARG proteins possess genuine PARG activity in vitro. 

To test whether bacterial and fungal PARG proteins can act as 
PARGs in vivo, we used the Saccharomyces cerevisiae system expressing 
human PARP! (ref. 14). Yeast cells, naturally deficient in PAR meta- 
bolism, can readily express human PARPI, resulting in poly(ADP- 
ribosyl)ation of various protein substrates'* (Fig. 2e). When divergent 
PARG proteins were co-expressed with human PARP1, we observed 
that the poly(ADP-ribosyl)ation levels in yeast extracts diminished, 
indicating an efficient hydrolysis of PAR in vivo (Fig. 2e and Sup- 
plementary Fig. 5). 

Although the first PARG gene was cloned in 1997 (ref. 15), until now 
there has been no structural data available for a PARG enzyme. To gain 
an insight into the PARG structure and mechanism, we solved the 


1Cancer Research UK, Paterson Institute for Cancer Research, University of Manchester, Wilmslow Road, Manchester M20 4BX, UK. Université de Paris-Descartes, Faculté de Médecine, INSERM U1001, 
156 rue de Vaugirard, 75015 Paris, France. 3Manchester Interdisciplinary Biocentre, Princess Street 131, Manchester M1 7DN, UK. 4ICOA-UMR CNRS 6005 Université d’Orléans, Rue de Chartres, F-45067 


Orléans, France. °Rudjer Boskovic Institute, Bijenicka 54, HR-10000 Zagreb, Croatia. 
*These authors contributed equally to this work. 


616 | NATURE | VOL 477 | 29 SEPTEMBER 2011 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


af 
Tou MAFHHR@RDPIl 144 
Hau 138 
Dra 143 
Afu — |YCVEBNIMIAWPAD H T €fefe) iE HFN VINO) REMC F RIS) SHES ayia \— ————K Oigg——————-— 185 
Hsa 788 
Bta 789 
Edi 305 
Tcu DLR- 189 
Hau ROMNH 183 
Dra SRPE 190 
Afu i KKYR 243 
Hsa DDWERINC T EVPAMED| LPF 836 
Bta DDWORMTTEFOUMED WMH E RR-Y—-——-—-—-—-— LF 837 
Edi [(VKLEGMGRRIKF Siar EQLISNN-------- yap -------- Sify 346 
«10 
e- QQQQQAAAAQLDA 
Tcu siGistaR 1,.V LGAWGCGVE GNDPA® 243 
Hau M | ae KYA G 239 
Dra POWL—P TIS E IVs Wine) TA T.VLGAWGCGVFIRNDPAG 244 
Afu YANAED SGI RLV LGAMGCCAFM IK 298 
Hsa_ VPiakiv SISEINLSAWANGIIWGCGAF CEDINSIINA Iie) 891 
Bta VP ld ANE BNGINWGCGAF GEDINSTINA T iT) 892 
Edi W 400 
b c 
Entamoeba dispar 
Trypanosoma brucei + = + + + + HA PARP 
Drosophila melanogaster + - + + + + Activated DNA 
Nematostella vectensis - - + - - — PARP inhibitor 
Homo sapiens - - =- + - — HAPARG 
Bos taurus - - = = + = HAPARGE108A 


Arabidopsis thaliana-1 
Arabidopsis thaliana-2 


i. —_—ereeinepalie elegans-2 
Caenorhabditis elegans-1 


Neurospora crassa 
pL soe fumigatus-1 
Aspergillus fumigatus-2 


Coprinopsis cinerea 
Herpetosiphon aurantiacus 
Deinococcus radiodurans 
Thermomonospora curvata 
Eubacterium rectale 

Adineta vaga 


PARG 


DUF2263 


Figure 1 | Phylogeny and functional relationship between DUF2263 and 
canonical-type PARGs. a, Multiple sequence alignment of different DUF2263 
and PARG proteins from T. curvata (Tcu), H. aurantiacus (Hau), Deinococcus 
radiodurans (Dra), Aspergillus fumigatus (Afu), Homo sapiens (Hsa), Bos 
taurus (Bta) and E. dispar (Edi). The two catalytic glutamates, a conserved 
glycine and tyrosine are marked with black asterisks, a grey asterisk and a black 


T. curvata PARG ligand-free crystal structure to 1.5 A in addition to 
ADP-ribose and ADP-HPD inhibitor-bound structures (Supplemen- 
tary Table 1). Comparison with available structures reveals that 
T. curvata PARG consists of an ADP-ribose-binding macrodomain 
fold with an amino-terminal extension (residues 3-67; Fig. 3c). Similar 
macrodomain structures are found in the Escherichia coli YmdB 
protein (PDB accession 1SPV; Z score 16.2, root mean squared devi- 
ation (r.m.s.d.) 2.1 A over 161 Ca atoms, 19% identity), an 
Archaeoglobus fulgidus macrodomain protein (PDB accession 2BFQ; 
Z score 15.6, r.m.s.d. 2.8 A over 164 Ca atoms, 16% identity)° and the 
histone variant MACROH2A1.1 (PDB accession 1ZR3; Z score 15.0, 
2.3 A rm.s.d. over 157 Ca atoms, 12% identity)'®. The diphosphate- 
binding loop that flanks one side of the ADP-ribose-binding cavity is 
highly conserved between PARG and other macrodomain structures 
(in magenta, Fig. 3a, c). In contrast, the opposite side of the PARG 
ADP-ribose-binding cavity is lined by a stretch of amino acids corres- 
ponding to the PARG-specific GGG-X,¢_-QEE signature sequence’ 
(in yellow, Fig. 3a—c). The structural alignment reveals this PARG- 
specific loop is inserted into the macrodomain fold to accommodate 
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cross, respectively. Secondary structural elements from the Tcu PARG 
structure are indicated above. b, YmdB-rooted phylogenetic tree of PARGs 
implied by the neighbour-joining method. Organisms devoid of PARP are 
marked in grey. c, H. aurantiacus (HA) PARP and PARG enzymes are active as 


shown by western blotting with anti-PAR antibodies. The positions of protein 
markers are indicated on the right margin (in kDa). 
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the Glu 115 side chain that projects into the PARG active site (Fig. 3c, 
d). Owing to the PARG-specific loop, it seems that only PARGs but 
not other macrodomain proteins can hydrolyse PAR (Supplementary 
Fig. 6). 

The ligand-PARG complex structures reveal that ADP-«-ribose 
occupies a position similar to that observed in other ADP-ribose- 
macrodomain complex structures” (Fig. 3d and Supplementary Fig. 7). 
A minor rearrangement can be observed for Val 226-Phe 227 upon 
ADP-ribose binding. The Phe 227 side chain is in close contact with 
the ribose’’ moiety of ADP-ribose, ensuring a close distance between 
the ribose’’ O4 and one of the phosphate groups within the constraints 
of the active site. The ribose’’ moiety also forms hydrogen bonds with 
Glu 114 (via the 2'-OH group) and Glu 115 (via the 1’-OH group) 
(Fig. 3d). With the exception of water-mediated polar contacts, few 
direct contacts can be observed with the adenosine moiety. In contrast 
to the ribose’’ moiety, the adenosine ribose’ moiety is markedly less 
accessible. The presence of any substituents on the ribose’ 2'-OH group 
(such as another ADP-ribose) would require a major reorientation of 
the PARG carboxy-terminal «-helix (see Fig. 4a) with concomitant 
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Figure 2 | PAR hydrolytic activities of divergent and canonical PARGs. a, A 
colorimetric PARG assay. b, Analysis of the hydrolysis of the PARP 1-generated 
PAR substrate by anti-PAR antibodies. AF, A. fumigatus; AV, A. variabilis; DR, 
D. radiodurans; HA, H. aurantiacus; HS, H. sapiens; ED, E. dispar; TC, 

T. curvata. The positions of protein markers are indicated on the margins of 

panels b, c and e (in kDa). c, SDS-PAGE-based assay with [°*P]-automodified 


exposure of the hydrophobic core of the macrodomain. This suggests 
that bacterial PARG binds the PAR terminus and acts as an exo- 
glycohydrolase. 

The steric restraints imposed by the PARG active site form a basis 
for the modelling of an «(1'’—2') O-glycosidic linkage with an addi- 
tional ADP-ribose group to provide further insight into PAR binding 
(Fig. 4a, b). Our models suggest that the (n — 1) adenine moiety is 
partially enclosed by Ala110, Ala112 and Val 226 while the (n — 1) 
ribose’ is in close contact with Ser 98 and Gly 104 (Fig. 4b). However, 
molecular dynamics simulations suggest that the enzyme lacks any 
obvious binding sites for the (n — 1) phosphate and ribose’’ moieties, 
as exemplified by the considerable conformational freedom of the 
latter with respect to the terminal ADP-ribose and the associated 
(n — 1) ribose’. This suggests that bacterial PARG does not specifically 
bind additional PAR elements. 
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PARP1 substrate. PARGs are inhibited by ADP-HPD. d, TLC analysis of PARG 
activity on the [°*P]-PAR substrate. The right side of the TLC plate was 
visualized by shadowing under ultraviolet light. e, Heterologously expressed 
A. fumigatus and D. radiodurans PARG hydrolyse PAR in human PARP1- 
expressing budding yeast cells. 


The T. curvata PARG complex with ADP-ribose (Fig. 3d) and the 
corresponding PAR-PARG model (Fig. 4b) allow us to propose a 
mechanism for PARG catalysis (Fig. 4c). Binding of the PAR terminus 
positions the key ribose-ribose O-glycosidic linkage in direct hydrogen- 
bonding contact with Glu 115, while constraining the conformation of 
the terminal ribose’’. Formation of a putative oxocarbenium inter- 
mediate is supported by the protonation of the (n — 1) PAR ribose’ 
2'-OH leaving group via Glu 115, and by the stabilization of the posi- 
tively charged oxocarbenium through close proximity with the ter- 
minal diphosphate group. A water molecule is ideally positioned to 
attack the oxocarbenium intermediate, activated through concomitant 
deprotonation by Glu 115. This leads to the release of ADP-f-ribose’’ 
and (nm — 1) PAR. 

This mechanism is supported by mutagenesis and binding studies. 
Although mutations of both Glu 114 and Glu 115 render the enzyme 
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Figure 3 | T. curvata PARG crystal structure in complex with ADP-ribose. 
a, b, Overall fold of PARG with «-helices depicted in red, -sheets in blue and 
bound ADP-ribose in green spheres. The PARG-specific catalytic loop is shown 
in yellow, and the diphosphate-binding loop in magenta. c, Overlay of the 

T. curvata PARG structure with a representative macrodomain structure (PDB 
accession 2BFQ, in cyan). The PARG-specific N-terminal extension and 
additional PARG structural elements are highlighted in red. Structural features 
that are similar to 2BFQ and other macrodomains are shown in grey. 

d, Detailed view of ADP-ribose bound in the PARG active site. ADP-ribose is 
shown with the corresponding 2F, — F, omit map density contoured at 1.26 in 
blue. Key active site residues are represented in atom coloured sticks with 
hydrogen bonds indicated by black dotted lines. 


inactive without disruption to the overall PARG fold (Fig. 4d and 
Supplementary Fig. 8), the isothermal titration calorimetry (ITC) 
ADP-ribose binding studies reveal that mutation of Glu 115 into an 
alanine does not affect binding, whereas the Glul14Ala mutation leads 
to an approximately tenfold decrease in binding affinity (Supplemen- 
tary Fig. 9). This supports the notion that the role of Glu 114 is con- 
fined to substrate binding/orientation as opposed to the acid-base 
catalysis proposed for Glu 115. The mutation of Phe 227 implicated 
in positioning the terminal ribose’’ also renders the enzyme inactive 
(Fig. 4d). Mutations of Ser 98 and Val 226, implicated by our model in 
binding the (n — 1) ADP-ribose, greatly diminish T. curvata PARG 
activity (Fig. 4d). Importantly, mutations of the corresponding cata- 
lytic residues in human PARG have a similar effect on the enzymatic 
activity, which suggests a universal catalytic mechanism for bacterial 
and mammalian PARGs (Fig. 4d). 

These data provide the first insight into PARG structure and cata- 
lytic mechanism, and improve our understanding of the biochemical 
strategies regulating reversible poly(ADP-ribosyl)ation. Our results 
indicate the existence of bacterial PAR metabolism and show that 
PARG is a member of the widespread macrodomain protein family”®, 
which is known to be implicated in genome stability and regulated by 
poly(ADP-ribosyl)ation’””. Our data confirm that PARGs are evolu- 
tionary unrelated to ARH3 proteins, another class of enzymes with the 
ability to cleave PAR”®”’ (Supplementary Information). We believe 
that our findings will provide the groundwork for future studies that 


Figure 4 | Structural basis of PAR 
glycohydrolysis. a, The PARG 
solvent accessible surface derived 
from molecular dynamics 
simulations. b, A detailed view of the 
lowest energy PARG-ADP-ribose 
dimer model obtained from the 100 
snapshot structures. Key active site 
residues are highlighted in green with 
PAR structural elements coloured as 
ina.c, Proposed mechanism for PAR 
glycohydrolysis. R, and R, represent 
(n — 1) PAR and terminal adenosine 
moieties respectively. d, PARG 
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mutants and the corresponding 

H. sapiens PARG mutants. WT, wild 
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might ultimately lead to the development of small, cell-permeable 
PARG inhibitors and the potential to manipulate the physiology of 
health and disease by interfering with PAR metabolism. 


METHODS SUMMARY 


Proteins with Hisg-tags were purified from E. coli. A colourimetric PARG Assay 
Kit (Trevigen) was used to measure the loss of biotinylated PAR from histones. 
Auto-modified poly(ADP-ribosyl)ated human and bacterial PARP substrates 
were made by incubation with NAD”. In vivo activity of PARGs was tested in a 
yeast system with inducible expression of human PARP1. The T. curvata PARG 
structure was solved by means of single-wavelength anomalous diffraction on Pt- 
soaked crystals. Ligand-enzyme complexes were obtained by the soaking of PARG 
crystals. The PAR-PARG model was created using molecular dynamics simula- 
tions of the ADP-«-ribose dimer in complex with PARG. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Plasmids and proteins. Bacterial and fungal PARG proteins and the E. dispar 
PARG were expressed from the pET28a vector (Novagen). Human PARG with 
an N-terminal truncation (A1-455) was expressed from the pColdTF vector 
(Takara). The A. fumigatus Af293 gene (AFUA_4G11940), the D. radiodurans 
R1 ATCC13939 gene (DR_B0099) and the E. dispar SAW760 gene (EDI_110590) 
were amplified from genomic DNA, the human PARPI and PARG genes 
(Q86W56) were amplified from HeLa cDNA, while the T. curvata DSM 43183 gene 
(Tcur_1721) and the A. variabilis ATCC 29413 gene (Ava_4013) were synthesized 
according to the database sequence (GenScript USA). The H. aurantiacus ATCC 
23779 PARG (Haur_1618) and PARP (Haur_4763) genes were amplified from 
genomic DNA extracted from a dried culture purchased from DSMZ. The codes 
in brackets indicate gene accession numbers. All proteins bear an N-terminal His 
tag with the exception of the A. variabilis PARG, which has a C-terminal His tag. 
Mutations were introduced using the QuickChange II site-directed mutagenesis kit 
(Stratagene). Proteins were expressed in E. coli Rosetta2(DE3) cells (Novagen). 
Recombinant proteins were purified on Ni-NTA beads according to standard 
procedure. For crystallization studies, the T. curvata PARG was purified by 
FPLC on a HisTrap HP column followed by Superdex 200 (GE Healthcare). The 
catalytic fragment of human PARP10 (amino acids 818-1025) was expressed as a 
GST-tagged protein and purified on glutathione sepharose columns. 

PARP activity assay. PARP automodification activity of the recombinant 
H. aurantiacus PARP was assayed in a 30-min room-temperature reaction con- 
taining 100nM H. aurantiacus PARP, 200 1M NAD (Trevigen), 50 mM Tris pH 
7.5 and 50mM NaCl with or without activated DNA (Trevigen), and with or 
without the PARP inhibitor KU-0058948 (10,1M). After the reactions were 
stopped with the PARP inhibitor, 500 nM H. aurantiacus PARG and its catalytic 
glutamate mutants were added to the reaction for 30 min. The reactions were 
analysed by western blotting with rabbit anti-PAR antibodies as described later. 
PARG activity assays. PARG activity of recombinant proteins was quantified 
with a colorimetric PARG assay kit (Trevigen), which measured the loss of bio- 
tinylated PAR from histones generated by PARP1. 3nM PARG enzymes were 
used for this assay. For western-blot analysis of PARG activity, PAR was synthe- 
sized by the automodification of PARP] in a reaction mixture containing 2 units of 
PARP1 (Trevigen), 200 uM NAD (Trevigen), activated DNA (Trevigen), 50 mM 
Tris pH 7.5 and 50 mM NaCl at 20°C. Reactions were stopped after 30 min by the 
addition of the PARP inhibitor KU-0058948. 500 nM PARGs were added to the 
reaction and incubated for another 30 min. The reactions were run on 4-12% 
SDS-PAGE gels and blotted onto a nitrocellulose membrane. PAR hydrolysis 
was detected by rabbit polyclonal anti-PAR antibodies (1:1,000; Trevigen). 
Western blots were analysed densitometrically by GeneTools (SynGene). For a 
direct visualization of PARG activity via SDS-PAGE, [°?P]-labelled NAD (GE 
Healthcare) was added to the above reaction to generate radioactively labelled 
poly(ADP-ribosyl)ated or mono(ADP-ribosyl)ated PARP substrates. 500nM 
PARGs were subsequently added with or without the PARG inhibitor ADP- 
HPD (50 LM). The reaction products were analysed by SDS-PAGE and visualized 
by autoradiography. For TLC analysis, the reactions were stopped by the addition 
of 50mM ADP-ribose, spotted onto polyethyleneimine (PEI)-cellulose plates 
(Macherey-Nagel, Polygram CEL 300 PEI/UV254) and developed in 0.15 M LiCl 
and 0.15 M formic acid. Dried plates were exposed on X-ray film or visualized by 
UV254 shadowing. 

Expression of heterologous proteins in S. cerevisiae. Plasmids used to introduce 
human PARP1, A. fumigatus PARG and D. radiodurans PARG into S. cerevisiae 
were created via the Gateway (Invitrogen) site-specific recombination system using 
yeast-specific destination vectors (Addgene)”. Human PARP1 was introduced into 
pAG303GAL-ccdB (HIS3), while A. fumigatus PARG and D. radiodurans PARG 
were introduced into pAG423GAL-ccdB (URA3). The host strain W303-la 
(MATa, ade2-1, ura3-1, his3-11,15, trp 1-1, leu2-3,112, can 1-100) was grown in rich 
YPD medium at 30°C. The strain was first transformed with the human PARP1- 
harbouring plasmid resulting in His* transformants with chromosomally inte- 
grated human PARP1. Subsequent transformation with the PARG-harbouring 
plasmids yielded His* Ura* transformants with episomally maintained PARG. 
To induce the expression of PARP1 and PARG from the GALI promoter, the 
strains were grown overnight in minimal SC medium with 2% glucose. The over- 
night cultures were washed, resuspended in SC+2% raffinose at 7 X 10° cells ml! 
and grown for one cell generation. Cells were centrifuged before resuspending in 
SC+2% galactose medium and grown overnight to induce the expression of human 
PARP1 and PARG from the GALI promoter. Yeast protein extracts were analysed 
by immunoblotting. Rabbit polyclonal anti-PAR antibody (Trevigen) and rabbit 
anti-PARP1 antibody (Abcam) were used at 1:1,000 dilution, while rat monoclonal 
anti--tubulin antibody (Abcam) was used at 1:5,000 dilution. Rabbit anti-D. radio- 
durans PARG and anti-A. fumigatus PARG polyclonal antibodies were raised 


LETTER 


against the respective full-length proteins purified from E. coli (Eurogentec) and 
applied at 1:1,000 dilution. 

Crystallization, refinement and model building. Initial T. curvata PARG crys- 
tallization conditions were identified using the JCSG+ matrix screen (Molecular 
dimensions). Crystals suitable for diffraction experiments were obtained by 
sitting-drop vapour diffusion in 4 ul drops containing equal volumes of protein 
and a solution containing 10% PEG 6K and 0.1M Bicine buffer (pH 9.0). The 
crystals were derivatised by soaking with the same solution supplemented with 
1mM potassium tetracyanoplatinate (II) trihydrate for 10 min. For the ADP- 
ribose and ADP-HPD complexes, native crystals were soaked with 1 mM ligand 
for 1-2 h. The crystals were cryoprotected with the addition of 15% PEG 200 to the 
mother liquor and flash cooled in liquid nitrogen. Data were collected on beamline 
104 at the Diamond Light Source Facility and reduced and scaled with the X-ray 
Detector Software suite (XDS)**. 

The ligand-free PARG crystal structure was determined by single-wavelength 
anomalous diffraction on a Pt-soaked crystal using Solve and Resolve as imple- 
mented in Phenix”. The initial electron density map was of sufficient quality for 
Resolve to build 70% of the model. Further automated model building with 
ARPwARP” successfully constructed the entire model, with the exception of a 
small loop region incorporating residues 58-66, for which no corresponding 
density could be observed. The model was completed by iterative cycles of manual 
model building and real space refinement using the program Coot” and crystal- 
lographic refinement using Phenix.refine™*. Validation of the structure was per- 
formed with Molprobity. The processing, phasing and final refinement statistics 
are presented in Supplementary Table 1. 

ITC. The thermodynamic ligand binding properties of wild-type T. curvata and 
two mutant proteins (E114A and E115A) were measured using a VP-ITC micro- 
calorimeter. Protein and ligand concentrations were 20 1M and 200 uM respec- 
tively in 50 mM Tris (pH 7.5), 100 mM NaCl. Titration curves were fitted using a 
nonlinear least squares method in Microcal Origin software. A model that was 
indicative of a single binding site was found to give the best fit in each case; this 
model was used to obtain the thermodynamic parameters shown in Supplemen- 
tary Fig. 9. 

Thermal shift assay. In each 50 ll reaction 15 yl of 300 X Sypro Orange (Sigma), 
2 ul of water and 33 pl of protein (4mg ml ~ ') in 25 mm Tris pH7.5, 50 mM NaCl, 
10% glycerol and 1 mM DTT were added to the wells of a 96-well thin-wall PCR 
plate (Bio-Rad). For thermal shift assays in the presence of ADP-ribose, 2 tl of 
25 mM ADP-ribose was added instead of water. Samples were placed in a CFX96 
Real Time PCR thermal cycler (Bio-Rad) and slowly heated from 15 to 95 °C with 
sample fluorescence recorded every 0.2 °C. Fluorescence was monitored at 575 nm 
(emission), using 490 nm for excitation. 

Analysis of PAR chains by sequencing gels. [*”P]-labelled PAR chains produced 
by PARPs were resolved in 10% sequencing gels in TBE buffer. Reactions were 
stopped by the addition of 600 ul 20% ice-cold trichloroacetic acid (TCA). The 
precipitated material was washed twice with 5% TCA. The pellet was then incu- 
bated with 70 pl of 10 mM Tris and 1 mM EDTA buffer (pH 12) for 2h at 60 °C to 
detach PAR chains. PAR was phenol-chloroform extracted, dried and resus- 
pended in 2mM EDTA-formamide buffer before loading. The products were 
visualized by autoradiography. 

Computational simulations. NAMD software” was used to perform all molecular 
dynamics simulations of the ADP-«-ribose dimer in complex with PARG. The 
topology and parameters files for the ligand were obtained using the Antechamber 
program”* and AM1-BCC charges”. The additional ADP-a-ribose monomer was 
added to the ADP-«-ribose bound to PARG by creating an «(1'’—>2') O-glycosidic 
bond. Five conformers of the added monomer were taken as starting points for 
molecular dynamics simulations. The PARG-ADP-«-ribose dimer complex model 
was then immersed in a periodic water box (TIP3), as half of the ligand was in 
contact with the bulk solvent. Then, for each conformer, several simulations 
(0.5 ns) were run with variable parameters (200K or 310K, peptidic backbone 
fixed, restrained or free). During these MD simulations, snapshots (100 in total for 
clarity) were periodically extracted and energy-minimized, to represent the con- 
formational flexibility of the ADP-«-ribose dimer. 

Analyses by UHPLC coupled to QTOFMS. Ultrahigh-performance liquid chro- 
matography (UHPLC) coupled to quadrupole-time-of-flight mass spectrometry 
(QTOFMS) was used to analyse the nucleotide products of PARG enzymes. The 
poly(ADP-ribosyl)ated PARP1 was treated with T. curvata PARG, and the mixture 
was filtered using centricons (30 kD cut-off). The analysis of the filtrate was per- 
formed using a modified previously described procedure”, which used ion-pair 
chromatography for the separation of nucleotides and negative electrospray 
ionization for mass spectrometric detection. All analyses were performed using 
a Waters Acquity UPLC system (Waters Corp.), equipped with a binary solvent 
delivery system and autosampler. The chromatographic separations used a 
column (50mm X 2.1mm) filled with a 1.7m BEH C18 stationary phase 
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(Waters Corp.). Binary gradients at a flow rate of 0.4 ml min’ ' were applied for the 
elution. The eluent A was water containing 5 mmol!’ of pentylamine, while the 
pH value was adjusted to 6.5 using acetic acid. A fast elution gradient was applied, 
which allowed an efficient separation of NAD+ and ADP-ribose within 5 min. 
The gradient started at 2% B and then the percentage of B linearly increased to 25% 
in 5 min. 

The mass spectrometry was performed on a QTOF Premier instrument (Waters 
Micromass) using an orthogonal Z-spray-electrospray interface. The drying gas 
and the nebulizing gas was nitrogen. The desolvation gas flow was set to 7001h + 
at a temperature of 300°C. The cone gas flow was adjusted to 251h ', and the 
source temperature to 120°C. The capillary and cone voltages were 3,200 V and 
30 V respectively. The instrument was operated in V mode with TOFMS data 
being collected between m/z 100-1,000, applying a collision energy of 4eV. All 
spectra were recorded using the extended dynamic range (DRE) option in order to 
correct for possible peak saturations, the data were collected in the centroid mode 
with a scan time of 0.08 s and interscan time of 0.02 s. In order to ensure maximum 
accuracy and reproducibility of the system, all acquisitions were carried out using 
an independent reference spray via the lock spray interface. Leucine enkephalin 
was applied as a lock mass in negative ionization mode (m/z 554.2615). 

The chromatograms, recorded in the total ion current (TIC) mode, were 
systematically examined by manually generating mass spectra of each visible 
individual peak using the background-subtraction option and searching for target 
components expected to occur in the mixture, in particular ADP-ribose and 
NAD+, using extracted ion chromatograms. The target analyses were based on 
the accurate mass feature of the instrument, applying a mass window of 50 mDa. 


Elemental composition of the selected ions was calculated using the MassLynx 
software incorporated in the instrument. The main criterion for selecting the 
elements to be included in search parameters for the calculation of the molecular 
formula was the elemental composition of the presumed parent compounds. 
According to the instrument specifications, the acceptable deviation from the 
theoretical m/z values was set at 5 mDa. 
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Online videos can show researchers details of a technique that they could never glean from written descriptions in journal articles. 


LEARNING TOOLS 


Visual aids 


Internet-based tools and videos are making it easier to perfect lab techniques and tasks. 
But they augment, rather than replace, conventional guidance in person. 


BY KATHARINE SANDERSON 


uring her PhD, neuroscientist Maria 
LD) itmereascne needed to use 

the single-cell reverse-transcription 
polymerase chain reaction, a specialized tech- 
nique that combines electrophysiology with 
molecular biology. But only a handful of labs 
in the world had experts in this technique, and 
she could not manage to visit them. Toledo- 
Rodriguez, who is now an academic fellow at 
the University of Nottingham, UK, spent two 
years struggling to master the technique by 
following written instructions. 

She wishes that she'd been able to take 
advantage of a now-flourishing type of tute- 
lage: watching a video of someone performing 
the technique. “There are very stupid things 
like when you remove the pipette, in order to 
avoid bursting the neuron, you have to give it 
a little kick,” she says. This is not the kind of 


detail that researchers generally put in a meth- 
ods paper — and Toledo-Rodriguez says that 
seeing the technique in action gives a much 
better feel for the nuances than reading a 
description of it. 

Videos that demonstrate how to run experi- 
ments, including all those ‘stupid’ little tricks, 
are appearing online. They could save time 
and money for researchers, and allow them to 
master techniques outside of their subfield. 


THE TACIT TOUCH 

Mentors and experienced hands aren't always 
able to offer the time and attention that it 
takes to teach a ‘technique rookie’ the nuances 
and tacit skills that make hands-on science 
smoother, so videos are becoming a major 
asset to those hungry for an expanded labora- 
tory skill set. Lab mates and collaborators, with 
their capacity for relating the feel of a success- 
fully executed technique, can never be replaced 


entirely, but film clips go a long way. 

Video protocols can be especially helpful 
when a researcher is switching fields, or delv- 
ing into new research territory. For example, 
when Rachel Schecter moved from research- 
ing yeast as an undergraduate to examining the 
neuroscience of autism as a PhD studentat the 
Massachusetts Institute of Technology (MIT) 
in Cambridge, she had to learn a whole new set 
of practical techniques. “I found it especially 
hard when I was starting out because the peo- 
ple in the lab who can teach you these things 
are often the busiest with their own stuff?” says 
Schecter. “So you really have to maximize the 
time they have to offer you.” 

Schecter became a fan of benchfly.com, a 
website that collects and organizes videos of 
laboratory techniques. The clips proved useful 
as tutorials, or as introductions to a method 
before she worked on it in person with some- 
one from the lab. They cover everything > 
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> from basic practical techniques, such as 
how to work in a sterile hood or use a pipette, 
to more technical procedures, including mak- 
ing primary neuron cultures, implanting elec- 
trodes or running a two-photon microscope. 
The videos help Schecter to get the “best use 
of the mentoring in the lab’, she says. And they 
allow her to double-check her technique after 
her initial training. “I find reading journal arti- 
cle methods sections incredibly frustrating. 
They don’t have the detail,” says Schecter, lik- 
ening the experience to following a cake recipe 
that says only to mix the ingredients, without 
describing exactly how much mixing to do or 
how vigorous it should be. 

BenchFly’s videos also offer tips on prob- 
lems facing lab newcomers, including the most 
common reason that a centrifuge won't switch 
on (a switch that starts the centrifuge only after 
a delay gets flipped accidentally). The site is 
organized by discipline and technique. And 
each video has a comment facility. As on sites 
such as YouTube, videos can be uploaded by 
anyone, from academics to students and lab- 
equipment companies. 

Alan Marnett founded BenchFly after a 
frustrating experience as a chemistry doc- 
toral student at the University of California, 
San Francisco. “I constantly changed fields,” 
says Marnett. “I was faced at every step with 
the challenges of a new environment.’ After 
his PhD, he pursued a postdoc at the Picower 
Institute for Learning and Memory at MIT, 
studying how neurons grow. As he tried to 
bone up on new techniques, he found that he 
received inconsistent training. “It is only as 
good as the people 
around you,’ he says. 
Marnett launched 
BenchFly in July 2009. 
The site sustains itself 
with money earned 
through advertising 
and video production 
services. 

Video tutorials save 
not only time and 
effort from mentors, 


but also money and “Ifind reading 
even travel, as Moshe methods sections 
Pritsker realized. As Utcr edibly 

aPhD student work- frustrating. 

ing on stem cells at They don’t have 


the detail.” 
Rachel Schecter 


Princeton University 
in New Jersey, Prit- 
sker was asked to rec- 
reate a method of culturing embryonic stem 
cells that had been reported by researchers 
in Edinburgh, UK. “I tried to follow the steps 
in an article,” he says. But try as he might, the 
experiment wouldn't work. 

His supervisor — keen to have someone in 
the lab who could reliably do the technique 
— flew Pritsker to Edinburgh for a two-week 
reconnaissance and learning mission with 
the group that had invented it. On the return 


flight, Pritsker started to question the neces- 
sity of what had just happened — flying across 
the Atlantic Ocean to watch someone else do 
an experiment. “This is not the twenty-first 
century, this is the Stone Age,’ Pritsker recalls 
thinking. 

His solution was to “bring the ‘show me’ 
effect” to scientists through videos of experi- 
ments. But to sell the concept, he decided to 
present the clips in the form ofa peer-reviewed 
journal. “I realized we needed an incentive for 
scientists to watch and make videos,’ he says. 
The result was the Journal of Visualized Experi- 
ments (JOVE), which Pritsker created in 2006 
after finishing a postdoc at Harvard Medical 
School in Boston, Massachusetts. 

Whereas BenchFly tends to show basic tech- 
niques, or small tricks to get something going, 
JOVE demonstrates detailed, specific and 
recently developed protocols. And in contrast 
to BenchFly’s do-it-yourself approach, JOVE’s 
videos are filmed and produced by an inter- 
national network of trained camera operators 
and producers. Production takes place after a 
text account of the video's technique is peer- 
reviewed and accepted for publication. Learn- 
ing the craft and technical aspects of science is 
what takes up most of a lab’s time and money, 
says Pritsker — and videos can help to curb 
such excesses. 

They may also be able to facilitate student 
comprehension. Videos are a great resource for 
exploring the finer technical points of a topic, 
says Alice Rushforth, a programme manager 
at the Center for Emergent Behaviors of Inte- 
grated Cellular Systems (EBICS) at MIT. She 
used BenchFly to teach undergraduates in her 
previous job as an MIT biology instructor. 
Rushforth notes that it’s not always possible to 
show every student every technique individu- 
ally — it’s both expensive and time consuming, 
and videos can fill the gap. 

Rushforth hopes to use BenchFly to make a 
package of videos for EBICS summer-school 
students. Toledo-Rodriguez notes that videos 
are especially helpful for scientists who teach 
general university courses involving tech- 
niques that aren't necessarily within their area 
of expertise. 


IN LIVING COLOUR 
Of course, videos have their limitations. 
“There is no better way to learn a new tech- 
nique than to stand beside an experienced 
practitioner and watch them work’, says Chris 
Surridge, editor of Nature Protocols. And it’s 
hard to replicate the benefits of a colleague's 
scrutiny and correction. Toledo-Rodriguez 
has used JOVE videos to learn methods for 
dealing with stem cells, but says that some- 
times it is not enough. “The videos go very 
fast, so for little details such as solution con- 
centrations, you need the numbers, and it’s 
always better to have the paper,’ she says. 
And then there's the possibility that the 
video protocol will be wrong. Marnett says 
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that BenchFly does exert quality control, 
trying to make sure that erroneous tech- 
niques don't slip through. “We will take a 
video down only if it incorrectly demon- 
strates a technique or if we feel it’s unnec- 
essarily dangerous,” he says. So far, there 
have been no such 
cases. But there are 
often several correct 
ways of performing 
any given technique, 
so BenchFly must 
be careful not to 
be too strict about 
what it allows. “Ifa 
video is posted that 
isn’t exactly how we 
would do it, but it is 


Training m scientifically sound, 
person “is only then we would cer- 
as good as the tainly leave it there,” 
people around says Marnett. 

you”. Videos that accom- 


Alan Marnett pany peer-reviewed 


articles are likely to 
be more accurate than those that are posted 
with minimal quality control, and some print 
journals are starting to request more videos. 
Nature Protocols has a section on its website 
(go.nature.com/bcjwzs), that hosts videos 
included with the supplementary information 
accompanying papers — but such submissions 
remain rare, says Surridge. 

The journal also has a YouTube channel 
(go.nature.com/alxr4h) dedicated to videos 
from papers or from the protocol exchange, 
an open-access repository run by Nature Pro- 
tocols. Scientists can use the resource to share 
their own tips and videos of techniques: they 
sign up to be part of an online ‘lab group; then 
they can post protocols, which must follow 
a required format and aren't necessarily vid- 
eos. The chemistry journal Angewandte Che- 
mie also allows researchers to submit video 
supplements. 

But video supplements can be difficult to 
find, as they’re not usually collated in a cen- 
tral location on journal websites, and they 
don’t necessarily show experimental tech- 
niques; rather, they might demonstrate a col- 
our change in a chemical reaction, or another 
visual result of the study with which they are 
associated. 

For an increasing number of researchers, 
however, videos provide a way to get up to 
speed in a new field and can supplement 
the unpredictable technical know-how of 
lab mates and supervisors. “Becoming a 
great scientist,” says Marnett, “is just as 
much about learning the tips and tricks that 
increase quality and reproducibility of data 
as it is about learning the science behind the 
experiments.” m 


Katharine Sanderson is a freelance writer 
based in Toulouse, France. 
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WOMANSPACE 


BY ED RYBICKI 


oud never think that all it took was 
Y wv middle-aged men, sent 

shopping by the wife of one 
to buy knickers, to crack the biggest 
discovery in modern physics. 

It was very simple: I'd been 
staying with my friend Russell in 
Canberra, trying to sort out how 
we were going to get our book on 
virus structure together, when 
Russell’s wife Lilia decided that 
their youngest daughter needed 
new school knickers. She was too 
busy making supper to bother; 
these otherwise unemployed elderly 
men were the perfect candidates — 
and the prospect of not having to 
listen to us blather on about just where 
to pitch the book, and what to put in it, and 
which Jethro Tull albums we liked, prob- 
ably tipped the balance our way. Seeing as 
we could continue to do all those things in a 
car and in the supermarket — and doa side 
trip to drool over new electronic goodies in 
Harvey Norman — we agreed with alacrity. 

And so it was, that after a most satisfying 
comparison shop of iMacs versus the rest, 
and a cruise through rock nostalgia in the 
shape of special-offer CDs, we found our- 
selves in a large supermarket, trawling for 
girls’ knickers. 

At this point I must digress, and mention, 
for those who are not aware, the profound 
differences in strategy between Men Going 
Shopping and Women Going Shopping. In 
any general shopping situation, men hunt: 
that is, they go into a complex environment 
with a few clear objectives, achieve those, 
and leave. Women, on the other hand, 
gather: such that any mission to buy just 
bread and milk could turn into an extended 
foraging expedition that also snares a to-die- 
for pair of discounted shoes; a useful new 
mop; three sorts of new cook-in sauces; and 
possibly a selection of frozen fish. 

And the interesting thing is — and this is 
what sparked the discovery — that any male 
would be very hard pressed to say where she 
got some of these things, even if he accom- 
panied her. 

Have you never had the experience of 
talking to your significant female other as 
you wend your way through the complexity 
of a supermarket — only to suddenly find 
her 20 metres away with her back to you? 
And then she comes back with something 
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you've never seen before, and tosses it in the 
trolley as if nothing has happened? 

Iknow I have — and until recently, I had 
always assumed it was just me not noticing 
new things in the aisles we were walking 
through. 

So there we were, looking for knick- 
ers, and a rather wary woman asked if she 
could help, given that we looked lost and 
hopeless. Russell explained to her exactly 
what we were looking for, and her wariness 
seemed to become mild alarm, until we has- 
tened to reassure her that this was in fact a 
commission for the mother of said child. 
She then said, with what seemed to be great 
satisfaction, “Oh, no; you'll never find those 
in here — you'll have to go down to [some 
remote location],” which we had no chance 
of achieving before they closed, so the whole 
mission was now a failure. 

It was as we trudged our forlorn way back 
to the car that Russell said: “You know, I’m 
sure we've found them there before — at 
least, Lilia has.” 

I said, only half-joking: “Well, women 
seem to be able to do that — maybe they're 
getting into spaces we poor guys cant?” 

That was the catalyst: suddenly, we Hunt- 
ers had an insight into how real Gatherers 
operated, sparked by our own hopeless- 
ness and some considerable acquaintance 
with the formidable talents of wives when it 
comes to finding things, and enough science 

(and science fiction) 
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number of important questions. Includ- 
ing, it seemed, supermarket shopping. 
Oh, we were roundly scorned when 
we got back, knickerless; both for 
the lack of same, and for our absurd 
idea — which smacked of desperate 
snatching at straws to excuse incom- 
petence, to the astrophysics-quali- 
fied wife. However, we persevered: 
we talked it through exhaustively, 
even conquering problems like: 
“What happens when you take 
some item from another universe 
to the checkout — when they 
won't recognize the barcode?” 
Except they do — electronic sys- 
tems are quantum-computing 
devices in their ability to access 
stock codes, and when they dont, 
then it’s invariably a woman who gets sent 
to look for them, and they, of course, find 
them... 

And there it might have sat, had it not 
been for the Internet. We simply put the idea 
up in as many forums as we could access; 
we blogged on it; we talked to everyone we 
knew (well, male, obviously) who could be 
relied on to observe such phenomena — and 
slowly, the observations came in. 

Then, of course, there was arguing about 
significance, and power of the statistical 
methods used, and it all got usurped by 
some theoretical physicists the moment it 
started to look as though there was some- 
thing in it. 

But the answer is clear: women can access 
parallel universes in order to find things, 
whether they do it consciously or not. They 
have probably always been able to do this, 
and now there is fierce speculation as to 
whether this constituted the evolutionary 
advantage we had over other primates: the 
presence of bulbs, grains and nuts on the 
table that had been retrieved from paral- 
lel universes when the hunters came home 
empty-handed was probably a major factor 
in the survival of our species. 

The difference is that now they know that 
they can do it — and things have changed. 

Because groceries aren't all they go look- 
ing for. It turns out the next item on the 
shopping list is better-looking versions of us. 

Russell and I no longer communicate. 
And were very lonely. = 


Ed Rybicki swears that at least some of 
this is a true story. The part about Russell 
Kightley and the knickers, anyway. 
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Detectable radio flares following gravitational waves 
from mergers of binary neutron stars 


Ehud Nakar! & Tsvi Piran? 


Mergers of neutron-star/neutron-star binaries are strong sources of 
gravitational waves’ °. They can also launch subrelativistic and mildly 
relativistic outflows** and are often assumed to be the sources of short 
y-ray bursts’. An electromagnetic signature that persisted for weeks to 
months after the event would strengthen any future claim of a detec- 
tion of gravitational waves’®. Here we present results of calculations 
showing that the interaction of mildly relativistic outflows with the 
surrounding medium produces radio flares with peak emission at 1.4 
gigahertz that persist at detectable (submillijansky) levels for weeks, 
out to a redshift of 0.1. Slower subrelativistic outflows produce flares 
detectable for years at 150 megahertz, as well as at 1.4 gigahertz, from 
slightly shorter distances. The radio transient RT 19870422 (ref. 11) 
has the properties predicted by our model, and its most probable 
origin is the merger of a compact neutron-star/neutron-star binary. 
The lack of radio detections usually associated with short y-ray bursts 
does not constrain the radio transients that we discuss here (from 
mildly relativistic and subrelativistic outflows) because short y-ray 
burst redshifts are typically >0.1 and the appropriate timescales 
(longer than weeks) have not been sampled. 

Gravitational-wave detectors, and in particular the advanced LIGO 
and Virgo interferometers, are being constructed now with the goal of 
detecting gravitational waves from binary neutron-star coalescence at 
distances up to a few hundred megaparsecs (redshift z~ 0.1)'*. The 
detection of an accompanying electromagnetic signal would comple- 
ment these efforts, providing an independent confirmation of the 
discovery and increasing the detectors’ effective sensitivity. The search 
for such an electromagnetic signal has therefore attracted much interest. 
The radioactive decay of ejected debris from the merger would drive a 
short-lived supernova-like event’*. For example, ejection of 0.01 solar 
masses (0.01M >) from a merger at a distance of 300 Mpc would result 
in a faint optical flare that peaks after ~1 day (ref. 14). Finding, and 
especially identifying, such rare and faint events in the crowded variable 
optical sky is an extremely challenging task. Other authors have specu- 
lated on the production of low-frequency radio signals from the inter- 
action of the neutron stars’ magnetic fields'*"’”. These attempts focused 
on electromagnetic signals that are contemporaneous with, or follow 
quickly, the merger and the gravitational waves. Unfortunately, these 
predictions are highly uncertain. 

Here we predict a robust radio signal that peaks several weeks after 
the merger. Numerical simulations show that compact binary mergers 
launch energetic subrelativistic and mildly relativistic outflows*®. 
Ejection sources include unbound tidal tails, and winds driven by 
neutrino heating, nucleosynthesis and electromagnetic processes'*', 
emerging from the proto-neutron star or from an accretion disk. 
Overall, almost all merger models find a significant ejection of mass 
and energy. In binary neutron-star mergers, an ejection of about 
10°° erg at (0.1-0.2)c (where c is the speed of light) and about 
10” erg as faster ejecta is a fairly robust prediction. The outflow from 
black-hole/neutron-star mergers is less certain, but it is possibly more 
energetic and faster'® (~ 10° erg at 0.5c). 

The interaction of this outflow with the surrounding tenuous matter 
generates a blast wave. Although the outflow may be highly non-uniform 


initially, it becomes spherical rather quickly. We therefore consider a 
spherical outflow with energy E and an initial velocity cf; that propa- 
gates into a medium with a constant density, n. If the outflow is not 
ultrarelativistic, it propagates at a constant velocity until time tage. 
when, at a radius Rae, it collects a mass comparable to its own. Time 
taec (in days) is given by 


Raec =_ —5 
licc= cB ~30 EB no ei 5/3 (1) 


1 


Here and in the following, unless stated otherwise, q, (where q is any 
parameter) denotes the value of q/10* in c.g.s. units. At a radius 
R> Raec, the flow decelerates, assuming a Sedov-Taylor blast wave: 
B~ B(R/Raec) >”. 

The blast wave generates magnetic fields and accelerates particles 
that emit synchrotron radiation. The same microphysics used success- 
fully to model radio emission of type Ibc supernovae’, where f ~ 0.2, 
and to model late radio emission of y-ray bursts**”, where the flow is 
mildly relativistic, is applicable here. In both cases, the electrons and the 
magnetic field are found to carry significant fractions of the total 
internal energy of the shocked gas, ¢. ~ é3 ~ 0.1. The observed spectra 
reveal a power-law distribution of the electrons’ Lorentz factor, y: dN/ 
dy x y ? fory > Ym = [(p —2)/(p — I] (mp /m.)ée 8”, where m, and m, 
are the proton and electron masses, respectively, ym is the minimal 
Lorentz factor of the electron’s distribution and p ~ 2-3. 

The radio spectrum is determined by v,,, the synchrotron frequency 
of electrons with Lorentz factor y,,, and by v,, the synchrotron self- 
absorption frequency (Supplementary Information). The specific flux, 
F,, at a given pene is strongly suppressed below y,, and it 
decreases as v°~)? above v,, and v. The signal across the whole 
spectrum increases at t < tgec. Its behaviour after ta.. depends on the 
relation between the observed frequency, Vops, and vy, and v,. The 
signal peaks at tacc if Vops > Vadeo Vmdeo Where Va dec = Va(taec) and 
Vmdec = Vm(taec). Otherwise the signal peaks when Vops = Vm or when 
Vobs = Va» Whichever is latest. 

The flare characteristics are most sensitive to the initial velocity of 
the outflow, /;. Because the brightest radio emission is observed at tdecs 
a lower value of f; implies a longer rise time of the radio emission after 
the merger. Additionally, Vin decs Vadec and the peak flux at any observed 
frequency depend strongly on f;. As mergers are expected to eject an 
outflow over a range of velocities, we discuss separately below the 
observed signature of mildly relativistic (fh; ~ 1) and subrelativistic 
(B, ~ 0.2) ejecta. 

A mildly relativistic blast wave with canonical parameters produces 
a synchrotron spectrum with Vv, dec = Vimdec ~ 1 GHz. The strongest 
signal is then expected at time face (a few weeks after the merger) 
and around 1.4GHz (Supplementary Information): the peak of the 
observed specific flux at v,,, in units of millijanskys is 


F,,.,,peak[Vobs > Vm,dec, Va.dec] id 


. = 2) 
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where d is the distance to the merger (here vp, is in GHz). The peak 
flux at lower frequencies (<1 GHz) is significantly lower and it is 
observed at a later time. If the outflow is subrelativistic (/, + 0.1- 
0.2), then Vadecs Vm,dec = 150 MHz and equation (2) is applicable also 
in the frequency range of low-frequency radio detectors. The flux 
peaks at taec, which is of the order of years, and it is brighter at 
150 MHz than at 1 GHz by about an order of magnitude. Note that 
over the whole expected range of blast wave parameters, v, = 1 GHz at 
all times and the spectrum above 1 GHz is optically thin during the 
entire evolution. We stress that in radio supernovae, the surrounding 
dense winds lead at early time to an optically thick spectrum at 
Vobs > 1 GHz, and the transition v, = v,, determines the time and flux 
at the peak. As discussed below, this different spectral signature 
enables us to distinguish between merger flares and radio supernovae. 

The circum-merger density also strongly affects the flare signature. 
For example, if the surrounding particle density is ~10 *cm °, the 
peak flux from a mildly relativistic ejecta decreases to the microjansky 
level at a distance of 300 Mpc, and the timescale increases by a factor of 
ten, to a year. A merger taking place in such a density can be detected 
only up to distances of ~100 Mpc (Table 1.) The density is expected to 
vary significantly, from n ~ 1 cm °,in galactic disks, ton ~ 10 °cm %, 
for mergers taking place outside their host galaxies. Because all observed 
Galactic neutron-star binaries reside within the Galactic disk, where the 
average density is n ~ 1 cm  °, a significant fraction of the cosmological 
mergers are expected also to take place in rather dense environments. 
We therefore use n = 1 cm’ * as the canonical density value. If mergers 
produce short y-ray bursts, then observations of their afterglows sup- 
port this value (Supplementary Information). 

An intriguing possibility is that compact merger events also eject 
ultrarelativistic jets that produce short y-ray bursts’ (SGRBs). It is 
important to examine the relationship between SGRBs and the radio 
flares discussed here, assuming that mergers are producing SGRBs. An 
SGRB beamed towards us will be observed in coincidence with the 
gravitational-wave signal, providing a clear electromagnetic counter- 
part. Even if the SGRB itself is missed, owing to partial sky coverage, 
its afterglow will be easily detectable. However, SGRBs are expected to 
be beamed, and only rarely will one point towards us. A beamed SGRB 
observed off-axis produces, once it has slowed down, a long-lasting 
radio ‘orphan’ afterglow”®, similar in its characteristics to the mildly 
relativistic signal discussed above. However, the total energy (corrected 
for beaming) in the ultrarelativistic jet is at most comparable to—and 
probably lower than—that of the mildly relativistic ejecta. Consequently 
the latter will dominate the radio emission. 

The radio remnant signals that we consider here, which are generated 
by subrelativistic and mildly relativistic outflows, could not have been 
detected in the radio afterglow searches that were carried out following 
SGRB triggers. The reasons are twofold. First, SGRBs are typically 
detected at distances of 1-3 Gpc, far beyond the detection horizon of 
gravitational-wave detectors. Hence the signals are much weaker than 
those associated with detected gravitational-wave events. Second, 
SGRB afterglow searches are optimized to detect the emission from 
ultrarelativistic ejecta pointing towards the observer. Such emission 


Table 1 | Observing radio flares 


peaks at higher frequencies and on shorter timescales than the emission 
from mildly and subrelativistic ejecta discussed here. SGRB afterglow 
searches are done at a sensitivity of ~0.1 mJy at 4.8-8.5 GHz during the 
first week or two after the bursts (see, for example, refs 27-29). Equation 
(2) implies that over the distance range 1-3 Gpc, the radio signal of 
mildly relativistic ejecta with energy 10°’ erg that propagate into a 
medium of density n=1cm ° peaks after ~60days at a flux of 
~0.01-0.1 mJy at 5 GHz. The flux before the peak rises as (t/taec)° 
(Supplementary Information). Therefore, these early radio afterglow 
searches could not have detected the radio signal, even if the mildly 
relativistic ejecta had an energy of 10°" erg. Thus, the paucity of detected 
SGRB radio afterglows has no direct implication for the nature of the 
remnants we discuss here. 

A new wave of radio detectors is now coming online. The most sensi- 
tive operate at frequencies of 1.4 GHz and higher. Table 1 summarizes 
the relevant properties of these facilities and their detection horizons. The 
best facility for a targeted search, following a detection of a candidate 
gravitational-wave source, is clearly the EVLA. A deep, ~50 Jy, loca- 
lized EVLA search of the 10-100 deg’ error box of a gravitational-wave 
trigger®® can detect mildly relativistic ejecta (with an energy of even 
~10** erg) out to the horizon of advanced gravitational-wave detectors. 
The upcoming lower-frequency LOFAR sensor array will be more effec- 
tive in searches for subrelativistic outflows, whose signals peak at 
LOFAR?’s frequencies, thus compensating for LOFAR’s lower sensitivity. 
LOFAR isalso relatively more effective when searching for flares in a low- 
density medium. 

Even before the completion of the advanced gravitational-wave 
detectors, blind searches can identify radio flares from compact binary 
mergers. Identification of radio emission from any merger type (binary 
neutron star or black hole/neutron star) would determine the merger 
rate, which is a parameter of utmost importance for the design and 
operation of advanced detectors. With a merger rate of 300 Gpc *yr ', 
we expect these facilities to detect ~20 remnants from mildly relativistic 
outflows with an energy of E=10”erg (and ~1,000 remnants if 
E=10°°erg), in a single whole-sky snapshot. LOFAR may detect a 
dozen transients in a whole-sky survey, even if only a subrelativistic 
outflow with an energy of 10°’ erg is ejected (see Supplementary 
Information for details, as well as for a discussion of ways to distinguish 
these flares from other possible radio transients). 

Remarkably, the observed 5 GHz transient RT 19870422 (ref. 11) 
shows all the expected properties of the radio remnant of a compact 
binary merger. At 1 Gpc distance and with a duration of two months, 
this transient is what we would expect from a mildly relativistic outflow 
with an energy of ~10°° erg. The inferred rate of similar transients!', 
80-20,000 Gpe * yr, is fully consistent with the estimates of com- 
pact binary mergers. This transient is therefore an excellent candidate 
to be the first observed radio remnant of a merger. Unfortunately, 
we cannot rule out the possibility that this is an especially bright 
radio supernova''. We note, however, that this latter interpretation 
requires a supernova brighter by an order of magnitude than any radio 
supernovae previously observed. Simultaneous optical observations or 
multiwavelength radio observations could have easily distinguished 


Radio facility Observing Field of One-hour One-hour detection horizont+ Ten-hour detection horizont 
frequency (GHz) view (deg?) rm.s.* (jy) 
Bi~], Bi~l, Bi = 0.2, Egg = 10, Bi~ 1, Ea9= 1, 
Eyg=1,n9 =1 E49 = 10, no = 1 no=1,p=2.5 no = 10-3, p=2 
EVLA 14 0.25 7 1Gpc 3.3 Gpe 370 Mpc 140 Mpc 
ASKAP. 1.4 30 30 500 Mpc 1.6 Gpc 180 Mpc 70 Mpc 
MeerKAT 14 15 35 500 Mpc 1.6Gpc 165 Mpc 65 Mpc 
Apertif 14 8 50 400 Mpc 1.25 Gpc 140 Mpc 50 Mpc 
LOFAR 0.15 20 1,000 35 Mpc 90 Mpc 70Mpc 20 Mpc 


Shown are properties and detection horizons (neglecting cosmological corrections) for an observation at different radio facilities of blast waves with various values of fj, E4g, No and p (in all cases, ¢. ~ ¢g ~ 0.1; see 
text for definitions of these symbols). Information on facilities is available as follows: EVLA (http://www.aoc.nrao.edu/evla); ASKAP (http://www.atnf.csiro.au/projects/askap/technology.html); MeerKAT (http:// 
www.ska.ac.za/meerkat); Apertif (http://www.astron.nl/general/apertif/apertif); and LOFAR (http://lofar.org). 

*The root mean squared value of the background noise for one hour of observation. 

+The distance at which the observed peak flux is four times the one-hour r.m.s. 

{The distance at which the observed peak flux is four times the root mean squared value of the background noise for ten hours of observation. 
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between the two possibilities. Unfortunately, no such observations are 
available. However, the detection rate implied by this event is very 
high, indicating that similar events could easily be detected by a rela- 
tively small-scale survey and that their nature should be easily probed. 
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Ephrin Bs are essential components of the Reelin pathway to regulate 


neuronal migration 


Aycan Senttirk, Sylvia Pfennig, Alexander Weiss, Katja Burk & Amparo Acker-Palmer 


Nature 472, 356-360 (2011) 


In this Letter we made errors in representative image choice, includ- 
ing mislabelling of images or choosing an image from the inappro- 
priate genotype. In all cases, choice of images was completely 
independent of the data analysis and so none of the conclusions in 
our original Letter are affected. We apologise for any confusion these 
errors may have caused. 

Figure la depicts a Tbr1 staining of the adult mouse cortex for four 
different genotypes. In the process of choosing representative pictures 
that reflect the results of our analysis shown in Fig. 1b, cropped images 
from original pictures were inadvertently mislabelled and used incor- 
rectly. We provide below a corrected version of Fig. 1a with new rep- 
resentative images for the following genotypes: WT and Reln‘’*; 
Efnb3 /~. A new high-magnification picture for WT is also shown in 
the two rightmost panels. Original images for every genotype and addi- 
tional examples are shown in the Supplementary Information of this 
Corrigendum. 

Figure 1c depicts a Brn1 staining of the E17.5 mouse cortex for 
five different genotypes. In the process of figure assembly cropped 
images from original pictures were inadvertently mislabelled and used 


la Reln*"* ; 
WT Rein*- 


Reln*- ; 


Efnb3"- — Efnb3-- 


Figure 1 | Corrected Fig. 1a (with two new panels) and corrected Fig. 1c. 


1c 
; 
ll 


Rein‘ ; Efnb3~- 


incorrectly. We provide below a corrected Fig. 1c with a new image for 
Reln*’t; Efnb3~~. In the ephrinB3 compound mice (Reln*’ ee 
Efnb3 /~) Brn1™ cells aberrantly accumulate in the lower layers of 
the cortex and do not migrate to the upper layers, resembling the 
Reeler (Reln ‘ ) phenotype. Original pictures and additional examples 
are shown in the Supplementary Information of this Corrigendum, 
where arrows indicate the distribution of Brn1~ cells. We have also 
included results from a new, reproduced experiment recently per- 
formed with an additional cohort of animals that shows exactly the 
same results. 

In Fig. 1d, the second panel, labelled ‘Reln‘’* ;Efnb3 should 
instead be labelled ‘Reln*’’. In the Methods summary section 
‘Stimulation of neurons’, “Cortical neurons from E14.5 were grown 
...” Should instead read “Cortical neurons from E15.5 were grown....”. 

Further errors in the Supplementary Information of the original 
Letter are described and corrected in the Supplementary Information 
of this Corrigendum. 


Supplementary Information is linked to the online version of the Corrigendum at 
www.nature.com/nature. 


Reln**; Rein*-; 
WT Reln*- — Efnb3-*- Efnb3*-— Reln~~ 
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Dynamics of human adipose lipid turnover in health 


and metabolic disease 


Peter Arner', Samuel Bernard’, Mehran Salehpour®, Goran Possnert?, Jakob Liebl*, Peter Steier*, Bruce A. Buchholz’, 
Mats Eriksson", Erik Arner', Hans Hauner®, Thomas Skurk®, Mikael Rydén', Keith N. Frayn’ & Kirsty L. Spalding® 


Adipose tissue mass is determined by the storage and removal of 
triglycerides in adipocytes’. Little is known, however, about adipose 
lipid turnover in humans in health and pathology. To study this in 
vivo, here we determined lipid age by measuring '*C derived from 
above ground nuclear bomb tests in adipocyte lipids. We report 
that during the average ten-year lifespan of human adipocytes, 
triglycerides are renewed six times. Lipid age is independent of 
adipocyte size, is very stable across a wide range of adult ages and 
does not differ between genders. Adipocyte lipid turnover, however, 
is strongly related to conditions with disturbed lipid metabolism. In 
obesity, triglyceride removal rate (lipolysis followed by oxidation) is 
decreased and the amount of triglycerides stored each year is 
increased. In contrast, both lipid removal and storage rates are 
decreased in non-obese patients diagnosed with the most common 
hereditary form of dyslipidaemia, familial combined hyperlipidae- 
mia. Lipid removal rate is positively correlated with the capacity of 
adipocytes to break down triglycerides, as assessed through lipoly- 
sis, and is inversely related to insulin resistance. Our data support 
a mechanism in which adipocyte lipid storage and removal have 
different roles in health and pathology. High storage but low trigly- 
ceride removal promotes fat tissue accumulation and obesity. 
Reduction of both triglyceride storage and removal decreases lipid 
shunting through adipose tissue and thus promotes dyslipidaemia. 
We identify adipocyte lipid turnover as a novel target for prevention 
and treatment of metabolic disease. 

A major function of adipose tissue is to store and release fatty acids, 
which are incorporated into adipocyte triglycerides according to 
whole-body energy demands. Body fat mass is determined by the 
balance between triglyceride storage and removal in adipocytes, by 
either enzymatic hydrolysis (lipolysis) and subsequent fatty acid oxida- 
tion and/or ectopic deposition in non-adipose tissues. Little is known 
about the dynamics of these processes in humans. Although isotope 
tracer methods have been used to estimate lipid turnover in human 
adipose tissue, these studies have been limited to short-term experi- 
mental conditions’ *. To study long-term adipose tissue lipid turnover 
in vivo and across the adult lifespan, we developed a method to retro- 
spectively determine the age of adipocyte triglycerides in humans. 
Triglycerides are the major component of the adipocyte lipid droplet. 
Lipid age was assessed by measuring the '“C content in the lipid com- 
partment of adipocytes from human subcutaneous adipose tissue, the 
major fat depot in humans. '*C levels in the atmosphere remained 
remarkably stable until above ground nuclear bomb tests between 
approximately 1955 and 1963 caused a significant increase in 4C 
relative to stable carbon isotope levels’ (Fig. 1a). After the Limited 
Nuclear Test Ban Treaty was signed in 1963, '*C levels in the atmo- 
sphere decreased exponentially. This is not due to radioactive decay 
(half-life (T,)2) for ‘“C is 5,730 years), but to diffusion of 4CO, out of 
the atmosphere’. '*C in the atmosphere oxidises to form CO;, which is 


taken up in the biotope by photosynthesis. Because we eat plants, or 
animals that live off plants, the '*C content in the atmosphere is 
directly mirrored in the human body. 

Radiocarbon dating has been used to study the incorporation of 
atmospheric '*C into DNA to determine the age of different human 
cell types, including adipocytes’’. Here, we compared the incorpora- 
tion of '*C into adipocyte triglycerides with the dynamic changes in 
atmospheric ‘*C described earlier. Triglyceride age was determined by 
using a linear lipid replacement model in which the age distribution of 
lipids within an individual was exponentially distributed correspond- 
ing to a constant turnover rate (per year)'*. The associated mean age, 
termed lipid age, is the inverse of the turnover rate and reflects the 
irreversible removal of lipids from adipose stores (Supplementary 
Information 1 and Fig. 1 of Supplementary Information 1). 

Earlier studies indicate that triglycerides in adipose tissue form two 
distinct pools with high or low turnover rates, respectively’*"*. Our 
data, obtained from individuals born before, during and after bomb 
testing, do not support the hypothesis of dual large lipid pools with 
different half-lives (Fig. 1b). '*C data were modelled according to one 
or more pools of lipids with different lipid removal rates (Supplemen- 
tary Information 1). The existence of a very small pool of younger 
lipids cannot be excluded based on data modelling (Supplementary 
Information 1 and Fig. 2 of Supplementary Information 1). According 
to a two-pool model the influence on the turnover rate is proportional 
to the fraction of lipid in the small pool. Triglyceride exchange between 
adipocytes and other small storage pools can affect turnover estimates. 
The two-pools model shows, however, that the non-adipose pool can 
be neglected when it makes up less than 20% of the lipids (Supplemen- 
tary Information 1, Fig. 3). Small pools with high turnover are more 
important for short-term (days or weeks) than long-term (years) 
triglyceride turnover. 

Mean lipid age was 1.6 years (Fig. 1c), which is in the same range as 
in short-term turnover studies*. The distribution of lipid age was 
compared with that of adipocyte age reported previously in a com- 
parable cohort’. The mean age of adipocytes was 9.5 years (Fig. 1d). 
This implies that triglycerides, on average, are replaced six times 
during the lifespan of the adipocyte, enabling a dynamic regulation 
of lipid storage and mobilization over time. 

There is a large variation in adipocyte size within and between indi- 
viduals (Supplementary Information 2, Supplementary Table 1)”. 
However, it is unlikely that the rate of triglyceride removal from 
adipocytes is important for these variations, as lipid age was not related 
to adipocyte size when set in relation to the body fat mass (Fig. 2a, b), 
nor was there a difference in lipid age between large and small adipo- 
cytes of the same adipose tissue sample (Fig. 2c, d). These data indicate 
that there is a continuous exchange of lipids between adipocytes within 
the adipose tissue that is not dependent on adipocyte size. Fatty acids 
produced by lipolysis in one adipocyte could, for example, be taken up 
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Figure 1 | Atmospheric '*C over time and its use to determine lipid age and 
adipocyte age. a, Above ground nuclear bomb testing during the period of the 
cold war caused an increase in atmospheric levels of '*C. These values decreased 
exponentially following implementation of a limited world-wide test ban treaty 
in 1963 (blue curve). Lipid age is determined by measuring '“C levels in lipids 
(1) and plotting this value against the bomb curve (2) to determine the 
difference between the year corresponding to the atmospheric ‘*C 
concentration (3) and the biopsy collection date (dashed line). Atmospheric *C 
levels are presented as 4C/'C ratios in units of fraction modern (for a 
definition of ‘modern’ see Supplementary Information 2). b, Lipid age and 
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Figure 2 | Relationship between adipocyte size and lipid age. a, b, Influence 
of adipocyte cellularity on lipid age. Individuals were assigned a morphometric 
value, which is the difference between the measured adipocyte volume for the 
individual minus the average adipocyte volume for all subjects (see 
Supplementary Information 2). This analysis was carried out across the full 
range of body masses. Positive values indicate larger adipocytes than expected 
(fewer but larger adipocytes = hypertrophy). Negative values indicate smaller 
adipocytes than expected (many but smaller adipocytes = hyperplasia). 

a, Individual values compared by linear regression analysis (n = 74). b, Data 
(mean + standard error) with morphology as a dichotomous variable (n = 36 
for hyperplasia and n = 38 for hypertrophy). An unpaired t-test was used. 

c, d, Isolated subcutaneous adipocytes were fractioned into small (fraction 1) or 
very large (fraction 4) samples (n = 7). Adipocyte volume (c) and adipose lipid 
age (d) were compared. Values are mean + standard error. A paired t-test was 
used. Data in a and bare from non-obese plus obese individuals in cohort 1 and 
data in c and d are from cohort 2. 
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turnover do not change as a function of person age. Lipid age is shown for three 
individuals born in 1940.2, 1959.9 and 1967.9. Lipid age was shown to be the 
same for all individuals, despite markedly different subject ages. Fat biopsies 
were collected from all individuals on the same date (dashed vertical line). The 
solid vertical lines indicate the date of birth. The small dashed lines show the 
\C lipid value for each individual. c, Distribution of values for lipid age in 
healthy non-obese or obese individuals from cohort 1 (n = 78). d, The 
distribution of values for human adipocyte age (n = 27). Adipocyte age data are 
obtained from a previous publication (see main text). 


by adjacent adipocytes and incorporated into their triglycerides. These 
processes would not be part of lipid removal as measured here. 

Lipid age and total fat mass data were used to determine the net 
triglyceride storage in adipose tissue (kg year‘) (see Supplementary 
Information 1). The net amount of lipid stored in adipose tissue each 
year is the sum of exogenous fat incorporation and endogenous syn- 
thesis, minus lipid removal. The removal rate represents the hydrolysis 
of triglycerides (lipolysis) followed by the irreversible removal of lipids 
by oxidation. A high lipid age therefore mirrors low removal rates. No 
relationship between lipid storage or removal and person age or gender 
was seen (Supplementary Information 2 and Fig. la-d of Supplemen- 
tary Information 2). 

Two clinical conditions where altered lipid metabolism is observed 
were investigated—obesity and familial combined hyperlipidaemia 
(FCHL); the latter is the most common hereditary lipid disorder 
(reviewed in ref. 16). It has an unknown aetiology and is a common 
hereditary cause of premature coronary heart disease. Adipocyte lipo- 
lysis is impaired in both conditions due to decreased cyclic AMP- 
dependent signalling, the major lipolytic pathway in adipocytes'”"”. 
Both conditions show a similar metabolic phenotype (mixed dyslipi- 
daemia, elevated apolipoprotein B and insulin resistance)”’. These 
clinical characteristics are confirmed in our study cohort (Supplemen- 
tary Information 2, Supplementary Table 1). FCHL individuals may 
present with a range of body fat levels; however, for our analyses only 
non-obese FCHL patients were selected so as to remove the confound- 
ing factor of obesity from the study. 

In obese subjects, the rate of triglyceride storage (Fig. 3a) and mean 
lipid age (Fig. 3b) were markedly increased compared to non-obese 
individuals. Both lipid age (r= 0.38, P= 0.0005) and triglyceride 
storage (r = 0.60, P< 0.0001) correlated with body mass index (BMI) 
when non-obese and obese individual were pooled together. Similarly, 
in non-obese FCHL individuals lipid age was increased to values 
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Figure 3 | Lipid turnover in subcutaneous fat. a, b, Lipid storage (a) and lipid 
age (b) were determined in 48 non-obese, 30 obese and 13 non-obese FCHL 
subjects. Error bars indicate standard error. Overall effect is P< 0.0001 by 
analysis of variance (ANOVA) ina and b. Results in graphs are from post-hoc 
test. Data are from cohort 1 (see Supplementary Information 2). A linear 
regression analysis was performed on all individuals from cohort 1 having 


observed in obesity (Fig. 3b). In contrast to obesity, however, the rate of 
triglyceride storage was markedly decreased compared to non-obese 
individuals (Fig. 3a). Thus, adipocyte triglyceride turnover is not just a 
mere reflection of the fat mass. Our data indicate a model where a 
combination of high storage and low lipid removal rates, as in obesity, 
facilitates triglyceride accumulation within adipose tissue, thereby pro- 
moting the development and/or maintenance of excess body fat mass. 
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insulin resistance measures (n = 82). c, ds HOMA-IR was correlated with lipid 
age (c) and lipid storage (d). The relationship between lipid age and HOMA-IR 
remained significant when BMI, gender or group (non-obese, obese, FCHL) 
were included in the analysis (partial r = 0.41, P = 0.006 with BMI using 
multiple regression analysis and F = 16.6, P = 0.0001 and F = 4.8, P = 0.03 for 
gender or group, respectively, using analysis of covariance (ANCOVA)). 


Conversely, a low rate of both triglyceride storage and removal, as in 
FCHL, leads to reduced triglyceride turnover and thereby a decreased 
ability of adipocytes to store and release fatty acids, despite a normal 
body fat mass. As discussed in detail elsewhere*'”’, low lipid turnover in 
adipose tissue may result in fatty acids being shunted to the liver, which 
drives the synthesis of apolipoprotein B and increases the circulating 
levels of triglycerides. Adipocyte triglyceride turnover may also be 
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involved in determining overall insulin effects. Insulin resistance 
(indirectly measured by the HOMA-IR index, see Supplementary 
Information 2) and lipid turnover were assessed in 82 individuals. 
Triglyceride age was strongly related to levels of insulin resistance 
(Fig. 3c), although there was no relationship between triglyceride storage 
and insulin resistance (Fig. 3d). There was no significant interaction 
between groups (lean, obese and non-obese FCHL) as determined by 
analysis of co-variance, indicating that the rate of triglyceride removal 
from adipocytes has an impact on whole-body insulin sensitivity inde- 
pendent of any underlying disorder. Multiple regression analysis 
showed that the relationship between HOMA-IR and lipid removal 
was not influenced by plasma triglycerides (partial r = 0.35; P = 0.007). 

We also examined non-obese and obese individuals separately 
(Supplementary Information 2 and Figure 2a-d of Supplementary 
Information 2). Variations in BMI were significantly related to lipid 
age only among non-obese and to lipid storage only among obese 
individuals. HOMA-IR variations were significantly related to lipid 
storage when obese subjects were removed from the analysis (no rela- 
tionship was found among obese subjects themselves). Thus, varia- 
tions in triglyceride turnover may have a different impact on metabolic 
status in obese versus non-obese populations. Clearly, this assumption 
must be confirmed by investigations in much larger samples. 

Because adipose tissue lipolysis is the first step in lipid removal, we 
investigated the ability of the cyclic AMP system to activate lipolysis in 
vitro in adipocytes isolated from lean and obese individuals and com- 
pared this with in vivo measurements of lipid storage and removal. 
Spontaneous (basal) lipolysis was not related to lipid turnover (Fig. 4a, 
b). However, the stimulated rate of lipolysis was positively correlated 
with triglyceride removal (inversely correlated with lipid age) but was 
not related to the rate of triglyceride uptake (lipid storage). This was 
irrespective of whether lipolysis was induced using a cyclic AMP ana- 
logue (Fig. 4c, d), by activating endogenous adenylate cyclase (using 
forskolin; Supplementary Information 2 and Fig. 3a, b of Supplemen- 
tary Information 2) or by administration ofa synthetic B-adrenoceptor- 
selective catecholamine (isoprenaline; Fig. 3c, d of Supplementary 
Information 2). These data indicate that lipolysis determines lipid 
turnover in adipocytes by regulating the rate of triglyceride removal. 
The impact of subsequent fatty acid oxidation could not be examined 
in this study; however, decreased lipid oxidation is frequently observed 
in obesity****. As there are regional variations in lipolysis and all our 
studies were performed on one fat depot no attempts were made to 
extrapolate findings to the whole-body level. 

Weare in the midst of a global epidemic of obesity with negative health 
and socio-economic consequences. We propose adipose triglyceride 
turnover as a novel target for the prevention and treatment of excess 
body fat and possibly its consequences for insulin resistance. New 
insights into abnormal triglyceride turnover in FCHL patients may 
also suggest novel treatment strategies for this complex disease that 
targets adipocytes. 


METHODS SUMMARY 

Subjects. Subcutaneous adipose tissue was obtained from two patient cohorts. 
Patient selection and collection of clinical data are described in Supplementary 
Information 2. 

Preparation of lipids. Triglycerides were extracted from pieces of adipose tissue 
or isolated adipocytes. Details of lipid extraction and adipocyte isolation are given 
in Supplementary Information 2. Extracted lipids were subjected to accelerator 
mass spectrometry analysis, as described in Supplementary Information 2. 

Data analysis. Calculations between lipid turnover and clinical or adipocyte phe- 
notypes are described in Supplementary Information 2. Calculations of lipid age and 
net lipid uptake by adipose tissue are described in Supplementary Information 1. 
Conventional statistical methods were used to summarize and compare data. 
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An olfactory receptor for food-derived odours 
promotes male courtship in Drosophila 


Yael Grosjean’, Raphael Rytz!, Jean-Pierre Farine’, Liliane Abuin!, Jerome Cortot?, Gregory S. X. E. J efferis? & Richard Benton! 


Many animals attract mating partners through the release of 
volatile sex pheromones, which can convey information on the 
species, gender and receptivity of the sender to induce innate court- 
ship and mating behaviours by the receiver’. Male Drosophila 
melanogaster fruitflies display stereotyped reproductive beha- 
viours towards females, and these behaviours are controlled by 
the neural circuitry expressing male-specific isoforms of the tran- 
scription factor Fruitless (FRU™)*-°. However, the volatile phero- 
mone ligands, receptors and olfactory sensory neurons (OSNs) that 
promote male courtship have not been identified in this important 
model organism. Here we describe a novel courtship function of 
Ionotropic receptor 84a (IR84a), which is a member of the chemo- 
sensory ionotropic glutamate receptor family’, in a previously 
uncharacterized population of FRU™-positive OSNs. IR84a- 
expressing neurons are activated not by fly-derived chemicals but 
by the aromatic odours phenylacetic acid and phenylacetaldehyde, 
which are widely found in fruit and other plant tissues’ that serve as 
food sources and oviposition sites for drosophilid flies*. Mutation 
of Ir84a abolishes both odour-evoked and spontaneous electro- 
physiological activity in these neurons and markedly reduces male 
courtship behaviour. Conversely, male courtship is increased—in 
an IR84a-dependent manner—in the presence of phenylacetic acid 
but not in the presence of another fruit odour that does not activate 
IR84a. Interneurons downstream of IR84a-expressing OSNs 
innervate a pheromone-processing centre in the brain. Whereas 
IR84a orthologues and phenylacetic-acid-responsive neurons are 
present in diverse drosophilid species, IR84a is absent from insects 
that rely on long-range sex pheromones. Our results suggest a 
model in which IR84a couples food presence to the activation of 
the fru™ courtship circuitry in fruitflies. These findings reveal an 
unusual but effective evolutionary solution to coordinate feeding 
and oviposition site selection with reproductive behaviours 
through a specific sensory pathway. 

While mapping the projections of Ionotropic receptor (IR)- 
expressing OSNs to the primary olfactory centre’, the antennal lobe, 
we observed that an Jr84a reporter was labelling neurons innervating 
the VL2a glomerulus. VL2a is one of only three glomeruli that are 
larger in males and whose OSN inputs and projection neuron outputs 
express male-specific isoforms of the behavioural sex determination 
gene fruitless (fru™)**. fru“-expressing OSNs have been implicated in 
promoting male sexual behaviours, because inhibition of synaptic 
transmission in all of these neurons simultaneously reduces male 
courtship of females’. We confirmed the expression of Ir84a in 
fru™-expressing neurons by visualizing the co-expression of an Ir84a 
reporter, as well as endogenous Ir84a transcripts, with a fru“ 
reporter*” (Fig. 1a and Supplementary Fig. 1a). No sexual dimorphism 
was observed either in the number of Ir84a-expressing cells (males, 
30 + 2 neurons; females, 30 + 3 neurons; n = 4 antennae each) or in 
their targeting to VL2a, indicating that FRU™ does not have an essen- 
tial role in the development of these neurons, similar to other fru”. 
expressing OSNs’. 


We generated a GAL4 knock-in null allele, Ir84a°*“ (Fig. 1b, c). 
1r84a°""* heterozygotes expressed a GAL4-responsive, membrane- 
targeted, green fluorescent protein (GFP) transgene (UAS-mCD8:GFP) 
exclusively in Ir84a-expressing OSNs (Fig. 1d, Ir84a*’~, and 
Supplementary Fig. 1b). In Ir84a°“"* homozygotes, the endogenous 
expression of Ir84a was lost, but the distribution and dendritic projec- 
tions of these neurons, as revealed by mCD8:GFP, was unaffected 
(Fig. 1d, I1r84a~‘~). The axons of Ir84a-expressing neurons in hetero- 
zygous and homozygous Ir84a°“" flies projected only to VL2a 
(Fig. le). Ir84a is therefore dispensable for the specification and wiring 
of the neurons in which it is expressed. An amino-terminal enhanced 
GFP (EGFP)-tagged version of this receptor’® localized to the cell bodies 
and the ciliated dendritic endings of these neurons but not to their axon 
termini (Fig. 1f, g), consistent with an exclusive role for IR84a as an 
olfactory receptor in the fru™ circuitry. 

We tested the responses of IR84a-expressing neurons to chemicals 
produced by male or virgin female flies"’, both by delivering headspaces 
of flies from a distance (simulating the action of volatile pheromones) 
and by presenting extracts from fly cuticles at close range (mimicking 
exposure to non-volatile hydrocarbons, such as contact pheromones”). 
These stimuli produced no or extremely small responses, as detected by 
extracellular recordings in ac4 sensilla (Supplementary Fig. 2a), which 
belong to the class of olfactory hair that houses IR84a-expressing 
neurons, as well as OSNs that express IR75d, or IR76a and IR76b*. 
These observations suggest that IR84a is not tuned to fly-derived 
pheromones. We therefore tested 163 structurally diverse odours. 
Only three of these gave responses of >50spikess ' above basal 
activity: phenylacetaldehyde (as identified previously’’), phenylacetic 
acid and phenylethylamine (Fig. 2a and Supplementary Fig. 2b). Dose 
response curves that revealed sensitivity to these ligands are similar in 
both sexes (Fig. 2b and data not shown). 

In Ir84a°*"* homozygous mutants, the responses to phenylacetic 
acid and phenylacetaldehyde were completely abolished (Fig. 2c, d, 
Ir84a'~). Re-introduction of Ir84a function in these neurons, by 
using UAS-Ir84a or UAS-EGFP:Ir84a cDNA transgenes, rescued these 
phenotypes (Fig. 2c, d, rescue), indicating a cell-autonomous function 
for IR84a in mediating these odour responses. By contrast, responses 
to phenylethylamine were unaffected (Fig. 2c, d), corroborating the 
evidence that this chemical is detected by the neurons that express both 
IR76a and IR76b”"”. Consistent with these loss-of-function data, mis- 
expression of IR84a in Odorant receptor 35a (OR35a)-expressing neu- 
rons was sufficient to confer responsiveness to phenylacetic acid and 
phenylacetaldehyde (Supplementary Fig. 2c). The basal activity in 
Ir84a mutant ac4 sensilla was also lower than that in the ac4 sensilla 
of wild-type and rescue genotypes (Fig. 2e, f), indicating that IR84a has 
a role in promoting spontaneous firing. 

Phenylacetic acid and phenylacetaldehyde are aromatic compounds 
found in a diverse range of fruit and other plant tissues’, as well as in 
their fermentation products“, and they are used in human perfumes for 
their floral, honey-like, sweet smell. We confirmed the presence of these 
chemicals in two host fruit for drosophilid flies’, overripe bananas and 
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Figure 1 | Gene targeting of Ir84a, a candidate olfactory receptor in the fru” 
circuit. a, Immunofluorescence using anti-GFP (green), anti-CD8 (magenta) 
and nc82 (blue; neuropil marker) antibodies on an antennal lobe of an animal 
in which an Ir84a promoter-lexA transgene drives (>) expression of a LexA- 
responsive CD2:GFP reporter transgene and in which the fru GAL4 insertion 
allele drives expression of a GAL4-responsive CD8 reporter transgene 
(genotype, Ir84a-lexA/UAS-mCD8;lexA op-rCD2:GFP/fru“*"). b, Schematic of 
the Ir84a locus, illustrating the organization of wild-type Ir84a and null mutant 
Ir84a°4" alleles. The span of the cloned Ir84a promoter is also indicated. P 
denotes PCR primer position. kb, kilobases. c, PCR confirmation of the 
1r84a@*" allele using the pairs of primers shown in b. d, Combined Ir84a RNA 
in situ hybridization (magenta) and anti-GFP immunofluorescence (green) on 
antennae from [r84a mutant flies: top, heterozygous mutants (UAS- 
mCD8:GEP/+;Ir84a““44/ +); and, bottom, homozygous mutants (UAS- 
mCD8:GFP/+ sIrs4ae*"/1r84a""*), Separate channels are shown on the right. 
e, Immunofluorescence using anti-GFP (green) and nc82 (magenta) antibodies 
on antennal lobes from Ir84a mutant flies: top, control heterozygous mutants 
(UAS-mCD8:GFP;Irs4a““"*/+); and, bottom, homozygous mutants (UAS- 
mCD8:GEP;Ir84a~""““/Ir84a““"). £, Immunofluorescence using anti-GFP 
antibodies (green) and antibodies specific for the sensory cilium base marker 
21A6 (magenta) on an antenna expressing EGFP:IR84a (UAS-EGFP:Ir84a/ 
+3118 4a 18440"), g, Immunofluorescence using anti-GFP (green) and 
nc82 (magenta) antibodies on an antennal lobe of a fly expressing EGFP:IR84a 
(UAS-EGFP:Ir84a;Ir84a""“/Ir84a“""*), a, d-g, Scale bar, 10 um. 
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the prickly-pear cactus Opuntia ficus-indica, as well as in laboratory 
Drosophila medium, by using gas chromatography—mass spectrometry 
analysis (Supplementary Table 1). The ubiquity of phenylacetic acid in 
vegetal tissues may be linked with its activity as a growth-regulating 
auxin’ and/or its production by plant-associated microorganisms’”. 
Small, but reproducible, quantities of phenylacetic acid and phenyla- 
cetaldehyde were also detected in whole-body cuticular extracts of male 
and virgin female D. melanogaster (Supplementary Table 1). The 
similarity in the relative amounts of these chemicals in laboratory 
medium and fruitfly extracts (Supplementary Table 1) suggested that 
these chemicals are transferred from food to flies during their culture. 
‘Clean’ cuticular extracts from animals grown on a minimal medium 
containing only sucrose and agarose consistently contain no detectable 
phenylacetaldehyde or phenylacetic acid (Supplementary Table 1). 

The expression of IR84a in fru“-expressing neurons implicates this 
receptor in the regulation of male courtship*’. Indeed, in single-pair 
courtship assays, we observed that I r84a°*"* mutant males court wild- 
type females significantly less than do wild-type males (Fig. 3a and 
Supplementary Fig. 3a, b). This phenotype was observed using both 
decapitated virgin females (which do not produce feedback signals) 
(Fig. 3a) and in more natural conditions, with intact females together 
with food (Supplementary Fig. 3b). Most individual components of the 
courtship ritual were affected in Ir84a mutant flies (Supplementary 
Fig. 3a). These defects were rescued with a UAS-Ir84a transgene, con- 
firming that they result from the absence of IR84a in OSNs (Fig. 3a and 
Supplementary Fig. 3a, b). The observed reduction in male hetero- 
sexual courtship index (~50%) is highly comparable to the phenotype 
of flies in which all FRU™-positive OSNs are silenced’, suggesting that 
IR84a-expressing neurons are the major olfactory fru” channel con- 
tributing to this behaviour. Residual courtship is presumably stimu- 
lated by other sensory modalities, such as taste’®”. Male wild-type D. 
melanogaster also show a low level of courtship towards other males, 
and we found that this homosexual courtship was also markedly 
reduced in Ir84a°“"* mutants (Fig. 3b). By contrast, Ir84a°*"* mutant 
females did not show overt defects in reproductive behaviours, includ- 
ing copulation latency, success or duration (Fig. 3c). 

In innate olfactory preference assays, Ir84a°*"* mutant flies still 
show robust avoidance of acetic acid, indicating that they do not have 
a general impairment in sensory detection (Supplementary Fig. 3c). By 
contrast, we did not observe any obvious responses of flies to pheny- 
lacetic acid (Supplementary Fig. 3d), suggesting that this food-derived 
odour is not a volatile stimulus that attracts flies but is a salient cue at 
close range. Notably, phenylacetic acid has a low vapour pressure 
(~0.005 mm Hg at 25°C) compared with other fruit volatiles (for 
example, ethyl butyrate, which has a vapour pressure of 15 mm Hg 
at 25°C). The observation that courtship is reduced in Ir84a°4" 
mutants in assays in which only small amounts of phenylacetic acid 
are present on fly cuticles (Supplementary Table 1) raises the possibility 
that spontaneous activity of these neurons also contributes to establish- 
ing a basal courtship level, which is abolished in the absence of IR84a 
(Fig. 2e, f). 

To test whether IR84a ligands are sufficient to promote courtship, 
we adapted our assay by using killed female objects (which males court 
at only low levels) and by replacing the base of the chamber with gauze, 
beneath which we placed a filter paper treated with odour or solvent. 
Perfuming with phenylacetic acid nearly doubled the courtship index 
of wild-type flies compared with a solvent control (Fig. 3d). This effect 
was abolished in Ir84a°*"4 mutants and could be restored, albeit not 
fully, by introducing a UAS-Ir84a transgene. By contrast, ethyl butyrate, 
which does not activate IR84a (Supplementary Fig. 2b), did not increase 
courtship (Fig. 3d). We also perfumed the courtship chamber with 
Drosophila food—which contains phenylacetic acid (Supplementary 
Table 1)—and observed that this complex olfactory stimulus induced 
IR84a-dependent increases in male courtship behaviour (Fig. 3e). 

The other fru“-expressing OSN populations express either OR67d, 
which is a receptor for the antiaphrodisiac male pheromone cis- 
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Figure 2 | Essential requirement for IR84a for spontaneous and odour- 
evoked neuronal responses. a, An odour ligand screen in acé4 sensilla. The 
mean evoked responses of ac4 sensilla neurons to a panel of 163 odours 
(representing the summed activities of the IR84a-expressing, IR75d-expressing, 
and IR76a- plus IR76b-expressing neurons). Data are presented as 

mean + s.e.m.; n = 4, mixed genders. An enlarged version of this histogram 
with all of the odour names indicated is provided in Supplementary Fig. 2b. The 
structures of the best three agonists are shown. b, The dose response curve to 
phenylacetic acid of male and female ac4 sensilla. Data are presented as 
mean + s.e.m.; m = 12 for each gender. The doses reflect the total mass of 
phenylacetic acid (diluted in solvent) in the odour syringe. The responses of the 
male and female sensilla are not significantly different at any concentration 
(analysis of variance (ANOVA), P > 0.0571). A higher concentration of 
phenylacetic acid could not be prepared because of its limited solubility. 

c, Representative traces of recordings of ac4 sensilla stimulated with the 
indicated odour stimuli (10 pl phenylacetic acid (10 jig pl’) or 10 pul (1% (v/v) 
other odours) in wild-type (w!!!8) 1r84a~/~ mutant (Ir84a*“4/1r84a~"") and 
IR84a rescue (UAS-Ir84a/+;I 18404) 1784q""4) flies. Paraffin oil and water 


vaccenyl acetate'’, or OR47b, which is activated by unidentified 
fly-derived odours from both sexes'' and may participate in mate 
localization’’’°. We examined how IR84a sensory information is inte- 
grated with these pheromonal pathways, by visualizing the axons of 
projection neurons innervating the VL2a (IR84a), VAllm (OR47b) 
and DA1 (OR67d) glomeruli, which carry sensory information to the 
mushroom body and lateral horn”' (Fig. 4a). We registered images of 
single-labelled projection neurons of different glomerular classes onto 
a common reference brain (see Methods). DA1 and VAIIm excitatory 
projection neurons target an anterior-ventral pheromone-processing 
region of the lateral horn”, which is segregated from projection neu- 
rons that are responsive to general food odours”. Importantly, we 
found that VL2a projection neurons—and no other IR-expressing 
projection neuron class’—are highly interdigitated with pheromone 
pathways and not food pathways (Fig. 4a, b). Indeed, VL2a projection 
neuron axon terminals overlap more strongly with VA 11m projection 
neurons than any of the other 44 projection neuron classes (see 
Methods and Supplementary Fig. 4b, c), consistent with projection 
neurons of both of these classes transmitting courtship-promoting 
sensory signals'?”°. The VL2a, DA] and VAIIm inhibitory projection 
neurons were observed to overlap to a similar extent (Supplementary 
Fig. 4a—c). The anatomical convergence of combinations of excitatory 
and inhibitory inputs from VL2a, VA1]m and DAI projection neurons 


are solvent controls. The grey bars above the traces mark the stimulus time (1 s). 
d, Quantification of mean responses to the indicated odours for the experiment 
shown in c. Data are presented as mean + s.e.m.; n = 11, mixed genders. The 
restoration of phenylacetaldehyde and phenylacetic acid responses by the 
expression of EGFP:IR84a (UAS-EGFP:Ir84a/+;Ir84a*"“/Ir84a“““) is also 
shown (green). For each odour stimulus, bars labelled with different letters are 
significantly different from each other (phenylethylamine ANOVA, 

P= 0.4861; phenylacetaldehyde ANOVA, P = 0.0003; phenylacetic acid 
ANOVA, P< 0.0001). For phenylacetic acid, EGFP:IR84a rescue is partial. 

e, Representative traces of spontaneous activity in ac4 sensilla in wild-type, 
Ir84a_‘~ mutant (Ir84ae*"*/1r84a~") and IR84a rescue (UAS-Ir84a/ 
+;Ir84a°*4/1r84a“"") flies. The arrows mark two distinct spike amplitudes 
discernible in the sparser spike trains in Jr84a mutant sensilla; these are likely to 
correspond to the IR75d- and the IR76a- plus IR76b-expressing neurons. 

f, Quantification of the mean spontaneous activity in ac4 sensilla in the 
experiment shown in e. Data are presented as mean + s.e.m.; n = 12, mixed 
genders. Bars labelled with different letters are significantly different (ANOVA, 
P= 0.0005). 


may allow the integration of olfactory signals by fru™-expressing 
third-order neurons**™* to control male courtship behaviour. 

Many olfactory IRs are conserved in insects and may detect 
odours that are important for all species. By contrast, although 
IR84a orthologues are present in ecologically diverse drosophilids, they 
are absent from other Diptera and more divergent insects (Supplemen- 
tary Fig. 5a). In the cactophilic species Drosophila mojavensis, we 
identified coeloconic sensilla with neurons that are responsive to phe- 
nylacetic acid and phenylacetaldehyde on their anterior antennal sur- 
face (similar to ac4 sensilla in D. melanogaster) (Supplementary Fig. 5b). 
Thus, IR84a may have a conserved, drosophilid-specific function. 

Despite the widely held assumption of the existence of volatile 
chemicals that promote courtship in Drosophila*”, behavioural evidence 
for long-range pheromones is inconclusive*®”’, and no female-specific 
volatile compound that activates male OSNs has been identified’’. Our 
characterization of IR84a identifies an olfactory receptor that is 
expressed in FRU™-positive neurons and is required to promote male 
courtship. Surprisingly, this receptor is not activated by fly-derived 
odours but rather by aromatic compounds that are present in the 
vegetal substrates in which fruitflies feed, breed and oviposit*. Thus, 
the IR84a pathway may promote male courtship in the presence of 
food, complementing the functions of pheromone receptors in regu- 
lating mate choice (Supplementary Fig. 5c). This model can account for 
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Figure 3 | IR84a is required for male courtship behaviour. a, Courtship 
indices for wild-type (w'"8), Ir84a_'~ mutant (Irs4a*"“/Ir84a~""") and 
IR84a rescue (rescue (1X), UAS-Ir84a/+ 31r84a“"4/1r84a~““, or rescue (2X), 
UAS-Ir84a/UAS-Ir84a;Ir84a""“/Ir84a“*“) male flies paired with wild-type 
decapitated virgin females. Data are presented as mean + s.e.m.; 1 is shown in 
the corresponding bar in this and other histograms. b, Courtship indices for 
male flies of the indicated genotypes (as in a) paired with wild-type decapitated 
males. Data are presented as mean + s.e.m. ¢, Receptivity of female flies of the 
indicated genotypes to wild-type males as measured by percentage copulation 
success (left), copulation latency (centre) and copulation duration (right). Data 
are presented as mean + s.e.m. d, e, Courtship indices for male flies paired with 
freeze-killed wild-type virgin females in an assay chamber perfumed from 
beneath a mesh (to avoid direct contact with the flies) with solvent (paraffin oil), 
phenylacetic acid or ethyl butyrate (d) or nothing or fly food (e). Data are 
presented as mean ~ s.e.m. Statistical analysis was performed as follows. a, b, A 
one-way ANOVA (Kruskal-Wallis, P< 0.0001) followed by Dunn’s multiple 
comparison post hoc test, in which all genotypes are individually compared 
with the control, was performed on all data. ***, P< 0.001. c, Values for 
percentage female copulation success were compared using a chi-squared 
statistical test. For copulation latency and duration, a one-way ANOVA 
(Kruskal-Wallis) followed by Dunn’s multiple comparison post hoc test were 
performed; no significant differences between genotypes were found. d, e, A 
two-way ANOVA, followed by a Bonferroni post-hoc test, was performed on all 
data, except for ethyl butyrate: P< 0.0001 for the odour stimulus, and 
P<0.0001 for the genotype as source of variation; P < 0.0001 for the 
interaction between the two sources of variation, confirming a link between 
odour and genotype. **, P< 0.01; ***, P< 0.001. Ethyl butyrate was compared 
to paraffin oil using a Mann-Whitney U test and yielded no statistical 
difference for control males. Although the rescue genotype displays a 
significant difference between phenylacetic acid (or food) and control 
perfuming treatments, its courtship indices are lower than those of the control 
genotype; this may result from lower levels of IR84a expression in the rescue 
animals or from indirect effects such as genetic background. 
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Figure 4| Anatomical integration of VL2a (IR84a-expressing) projection 
neurons in the pheromone processing centre. a, Top, three-dimensional 
rendering of registered axonal projections of projection neurons (PNs) 
receiving input from the VL2a glomerulus (IR84a-expressing) (dark blue), the 
VAI1m glomerulus (OR47b-expressing) (green) and the DA1 glomerulus 
(OR67d-expressing) (red) (n = 4 for each class). Other classes of projection 
neuron are shown in cyan (nine IRs”) and grey (27 ORs”) (n = 1 for each class). 
These projection neurons are cholinergic and project to both the mushroom 
body and the lateral horn through the inner antennocerebral tract (iACT); a 
small minority of other projection neurons are GABAergic (y-aminobutyric 
acid-containing) and project exclusively to the lateral horn through the middle 
ACT (Supplementary Fig. 4a). Bottom, when the other projection neurons are 
removed, the extensive co-mingling of VL2a and VA 1m projection neurons is 
evident. Scale bar, 25 tm. D, dorsal; L, lateral; M, medial; V, ventral. b, Cluster 
analysis of normalized axon overlap scores (y axis) using Ward’s method (see 
Methods) demonstrates that VL2a projection neurons cluster with projection 
neurons from other fru™-expressing glomeruli rather than with other IR- 
expressing projection neurons. Colours are as in a. 
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the widespread observations that D. melanogaster and other drosophilids 
mate predominantly on their food substrates*”*. Whereas many insects 
and other animal classes use long-range sex pheromones to attract 
potential mates’, the evolution of IR84a in fruitflies has provided an 
alternative (although not necessarily exclusive) olfactory mechanism to 
unite males with females by integrating food-sensing neurons with the 
circuitry controlling sexual behaviour. Whether other animals have 
dedicated sensory pathways for environmental ‘aphrodisiacs’ remains 
an open question. 


METHODS SUMMARY 


Genetic and molecular biological manipulations and histological and electro- 
physiological analyses were performed essentially as described previously*””°. 
Responses to Drosophila-derived stimuli were measured by adapting previous 
protocols’. Drosophila extracts and food or fruit extracts were made using 
CH,Cl, solvent and were analysed by gas chromatography—mass spectrometry 
on a GC-MS-QP2010 apparatus (Shimadzu). Courtship tests were performed at 
25 °C in round observation chambers (1.2 cm in diameter and 0.4. cm deep) under 
far-red light and were video recorded and annotated manually by observers who 
were blinded to the genotypes. For perfumed behavioural assays, the courtship 
chamber was placed on top of an identical chamber, with the chambers separated 
by muslin gauze to avoid contact between the flies in the top chamber and the 
chemical stimuli placed in the bottom chamber. Olfactory responses were mea- 
sured using a Y-maze assay. Projection neurons from several sources (see 
Methods) were traced, where necessary, using Amira software (http://www.amira. 
com) and registered, by adapting previous approaches”. The degree of overlap 
between the axon terminals of different projection neurons was measured using 
the three-dimensional convex hull formed by each set of terminals. Cluster ana- 
lysis of these data was performed using Ward’s method. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Drosophila strains. Drosophila stocks were maintained on a standard corn flour, 
yeast and agar medium under a 12h light and 12 h dark cycle at 25°C. We used the 
following mutant and transgenic strains: Or35a-GAL4 (ref. 13), Ir84a-GAL4 (ref. 9), 
UAS-Ir84a°, UAS-EGFP:Ir84a'°, UAS-mCD8:GFP”, fru?" (ref. 5), fruP1-GAL4 
(ref. 4), lecAop-rCD2::GFP, UAS-mCD8 (ref. 30), UAS-DTT’', 70FLP,701-Scel/Cyo, 
70FLP and 70I-CreI*. For behavioural analysis, all flies carrying mutant and trans- 
genic chromosomes were backcrossed to an isogenized w'"”* control line for five 
generations. The sequenced wild-type D. melanogaster strain (Bloomington 
Drosophila Stock Center number 2057) was used for the ac4 odour screen, and 
Canton-S was used as the courtship object in behavioural experiments. Wild-type 
D. mojavensis was obtained from the Drosophila Species Stock Center (University of 
California, San Diego; stock number 15081-1352.22). 

Molecular biology. New transgenes were constructed as follows, using standard 
methods. For the Ir84a GAL4 targeting construct, 5,087 base pairs (bp) genomic 
DNA immediately upstream of the [784a start codon was amplified and subcloned 
as 1,924-bp and 3,163-bp fragments either side of the I-Scel site upstream of GAL4 
in the pED36 targeting vector’. Similarly, a 964-bp genomic fragment immediately 
downstream of the Ir84a stop codon was cloned downstream of GAL4. For the 
Ir84a promoter-lexA construct, the same 1,964 bp I784a promoter fragment used 
in the Ir84a-GAL4 transgene” was cloned upstream of lexA:VP16-SV40 (ref. 30) in 
the vector pattB*. 

Gene targeting. Targeted replacement of Ir84a by GAL4 was performed essen- 
tially as described previously'*. A transgenic line containing the Ir84a GAL4 
targeting construct on the X chromosome was used as a donor for targeting crosses 
in which heat-shock-inducible transgenic FLP and I-Scel enzymes were used to 
excise and linearize the targeting fragment. Reinsertion was detected by the pres- 
ence of a FLP-insensitive white’ marker. From ~193,000 flies screened, five 
targeted insertions were recovered and verified by PCR as duplicates of the 
Ir84a locus. One duplicate was resolved with I-Crel, selecting for loss of white’. 
From 218 white flies, 1 was found to contain the desired precise Ir84a to GAL4 
replacement. 

Histology. Immunofluorescence on antennal cryosections was performed as 
described previously’®*’. Immunofluorescence on whole mount brains, and com- 
bined fluorescent RNA in situ hybridization and immunofluorescence on antennal 
sections, was performed as described previously’. The primary antibodies used 
were mouse monoclonal nc82 (1:10 dilution; provided by A. Hofbauer), mouse 
monoclonal 21A6 (1:200; Developmental Studies Hybridoma Bank), rabbit anti- 
GFP (1:1,000; Invitrogen), mouse anti-GFP (1:500; Invitrogen), rat anti-mCD8 
(1:50; Caltag), anti-DIG-POD (1:500; Roche). The secondary antibodies used were 
Alexa Fluor 488-, Cy3- and Alexa Fluor 647-conjugated goat anti-mouse, anti- 
rabbit or anti-rat IgG (Molecular Probes and Jackson ImmunoResearch). These 
were diluted to 1:100 and 1:1,000 for whole mount brains and antennal sections, 
respectively. All microscopy was performed using an LSM 510 laser scanning 
confocal microscope (Zeiss). 

Electrophysiology. Extracellular recordings in single sensilla of 1-8-day-old flies 
were performed and quantified as described previously®. Odours were diluted to 
1% (v/v) unless otherwise noted. The CAS numbers, solvents, sources and purities 
for the odorants used are provided in Supplementary Table 2. Corrected responses 
were quantified by counting all spikes in ac4 sensilla in a 0.5 s window from 150- 
200 ms after the stimulus trigger. This delay period was defined precisely for each 
recording session with a control odour response and mainly represents the time 
for the odour to reach the preparation. From this spike value, we subtracted the 
number of spontaneous spikes in a 0.5 s window before stimulation and doubled 
the result to obtain spikes s'. Spontaneous activity was quantified by counting the 
spikes in a 5 s window without stimulus and then dividing by five to obtain spikes 
s '. After verifying that the responses were normally distributed, we compared all 
genotypes for a given experiment by analysis of variance (ANOVA), with the 
genotype as the main effect, and adjusted the alpha level for planned post hoc 
means comparisons. 

To assess the responses of live flies to volatile compounds, ~ 100-120 male or 
virgin female flies were placed in a 5 ml glass pipette (witeg Labortechnik). The 
pipette was then closed at each end with cotton gauze, and an air pulse (3 s) was 
blown through it. As controls, an empty pipette was used or a pipette loaded with 
10 pl phenylacetic acid (0.1 pg pl! in paraffin oil, as 1 pl drops on ten individual 
2 mm’ filter papers). The responses were quantified by counting all of the spikes in 
a 1s window from 500 ms after the odour stimulus trigger and then subtracting the 
number of spontaneous spikes in a 1s window before stimulation. To assess the 
responses to D. melanogaster cuticular hydrocarbons, we prepared extracts in 50 ul 
CH,Cl, as described below (except without the n-C13 standard) and then evapo- 
rated the solvent under N and diluted the extract in paraffin oil to a concentration 
of 10 fly equivalents pl~ ' These extracts (1 ll) (or control stimuli) were placed on 
the tip of a borosilicate glass capillary (World Precision Instruments) and brought 


into close proximity (~5 mm) to the antenna. The changes in spiking frequency 
were quantified by measuring the spontaneous spiking frequency in spikes s ’ ina 
3s window before the stimulus was brought close to the antenna and then sub- 
tracting this value from the spiking frequency in a 3 s window when the stimulus 
was placed close to the antenna. This second 3s window typically started 3-5 s 
after stimulus placement, a time frame when the responses to phenylacetic acid 
were most robust. 

Extract preparation and chemical analysis. As phenylacetaldehyde and phenyla- 
cetic acid are polar compounds, we used CHCl; for all extractions. D. melanogaster 
extracts were prepared by incubating five to ten 5-day-old flies for 5 min in vials 
containing 50 jl CH,Cl, and 100 ng n-C13 (as an internal standard) at room tem- 
perature. After the flies were removed, the extracts were stored at —20°C until 
analysis. For standard fly food and fruit extracts, 2 g material was crushed in 20 ml 
vials containing 4 ml CH2Cl, and 200 ng n-C13 as an internal standard. After vortex- 
ing for 10 min at room temperature, the extracts were filtered through glass wool and 
concentrated to ~ 100 pl under a gentle flow of N>. The extracts were stored at —20 °C 
until analysis. 

Because of the difficulty of detecting phenylacetic acid and phenylacetaldehyde 
at low concentration in the extracts, we tested three column types with different 
polarities: a CP-Sil 5 CB (apolar type, 25 m X 0.25 mm internal diameter, 0.12 um 
film thickness; Varian), a CP-Wax 58 CB (polar type, 25m X 0.25 mm internal 
diameter, 0.20 jum film thickness; Varian) and a VF-1ms (medium polarity type, 
20m X 0.15 mm internal diameter, 0.15 1m film thickness; Varian). Only the last 
two columns gave satisfactory quantitative results, and these were used for all 
subsequent analyses. The extracts were analysed using a GC-MS-QP2010 appar- 
atus (Shimadzu) in splitless mode, fitted with a VF-1ms (D. melanogaster extracts) 
or a CP-Wax 58 CB (standard fly food, banana and prickly-pear extracts) fused 
silica capillary column. The columns were held isothermally at 40 °C for 2 min, 
then the temperature was programmed to increase at a rate of 3°Cmin ' to 
240°C (CP-Wax 58CB) or 300°C (VF-1ms). Helium was used as the carried 
gas at a linear velocity of 47 cms _'. The injector port was set at 280 °C. The mass 
spectrometer was operated at 70eV, and scanning was performed from 29 to 
600 AMU at 0.5 scanss  '. The injection split was opened 1 min after the injection. 
Phenylacetic acid and phenylacetaldehyde were identified using their retention 
time and their fragmentation patterns; diagnostic ions were compared with both 
the NIST/EPA/NIH Mass Spectral Library and the mass spectrum of synthetic 
chemical standards (Sigma-Aldrich) analysed under the same conditions. For 
quantitative analyses of phenylacetic acid and phenylacetaldehyde, the response 
factor was determined for each of the compounds and n-C13 at 0.1, 0.5, 1, 5 and 
10ng on both types of column. For analysis of extracts from flies grown on 
minimal medium, late second or young third instar larvae reared on standard 
food medium were washed six times in distilled water to remove any traces of 
medium. Larvae were then transferred to a minimal growth medium (0.1 M sucrose 
and 1% agarose). At emergence, adults were separated by sex and aged in groups of 
20 flies on minimal growth medium for 5 days, before extraction as described above. 
Courtship assays. Male and virgin female flies were collected at eclosion. They 
were kept in vials, individually for subjects or in groups of 10 for objects, for 5-9 
days before behavioural assays. The vials contained fresh food and were kept under 
a 12h light and 12h dark cycle at 25°C. Behavioural assays were performed at 
approximately the same time each morning. Single-pair courtship tests were per- 
formed in round observation chambers (1.2 cm in diameter and 0.4 cm deep, with 
nine-chamber plexiglass courtship wheels) under far-red light (to eliminate the 
contribution of visual cues) at 25°C. For most experiments, object females (or 
males) were decapitated with a clean razor blade (to avoid any reciprocal inter- 
action during courtship) and were left to ‘recover’ for ~ 10 min before introduction 
into the courtship chamber. We also tested males for their courtship behaviour in 
more natural conditions: using intact live virgin Canton-S females, together with 
~90 mg fly food medium (corresponding to approximately one-quarter of the 
courtship chamber volume). Male courtship behaviour towards these different 
types of object was recorded for 10 min with an HDR-SR10 digital video camera 
(Sony). After recording, videos were analysed by a researcher who was blinded to 
the genotypes, using Annotation software (SaySoSoft). The courtship index was 
calculated as the percentage of time that a male courted the object during a 10-min 
period. Female copulation behaviour was quantified by direct observation for 1 h 
without video recording: copulation latency represents the time to the beginning of 
copulation from the moment that the virgin female and the male are placed in the 
chamber; copulation success is the percentage of flies that copulate; and copulation 
duration is the total time of copulation from the start to the end of copulation. 

For perfumed assays, the courtship chamber was placed on top of an identical 
chamber, and the chambers were separated by muslin gauze to avoid direct contact 
between the flies in the top chamber and the chemical stimuli in the bottom 
chamber (food, 50 + 2 mg standard fly food; solvent, 10 ul paraffin oil (on a filter 
paper or aluminium foil pad); phenylacetic acid, 10 pl ~35 1g ul’ solution in 
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paraffin oil (that is, ~2.5 ,tmol total odour); ethyl butyrate, 10 jl 1% (v/v) solution 
in paraffin oil (that is, ~0.8 mol total odour)). Single object males were placed in 
the upper chamber with a killed intact virgin wild-type female at 25 °C. The female 
had been anaesthetized with CO), placed in an Eppendorf tube and killed by 
submerging the tube in liquid nitrogen. The female was then warmed for 
~10 min at 25°C in a Petri dish before being placed in the courtship chamber. 
Males court killed objects only at low levels, facilitating the determination of the 
enhancing effect of perfuming. 

Olfactory assays. Male olfactory responses were tested using an adapted Y-maze 
assay~’; in brief, three-way plastic tube connectors (Reactolab) were used to join 
three glass vials by way of 1 ml pipette tips that pierced the cotton vial tops, 
forming a tightly sealed Y maze. The narrow ends of the pipette tips were cut 
and oriented in each vial top to form one ‘loading’ vial and two ‘trap’ vials, which 
contained 40 pl odour (10% (v/v) acetic acid in water (that is, ~70 mol total 
odour) or ~35 pg pl’ phenylacetic acid in paraffin oil (that is, ~10 mol total 
odour)) or 40 pl corresponding solvent on filter paper. Ten 5-9-day-old males 
(starved for 18-21h in glass tubes with water on filter paper) were introduced 
without CO, anaesthesia into the loading vial and were allowed 2h at 25°C to 
choose to enter the trap vial containing the odour or the solvent. The resultant 
olfactory index was obtained with the following formula: (number in the odour 
tube — number in the solvent tube)/total number of loaded males. 

Projection neuron analysis. Projection neuron images (275) from four different 
sources were brought into a common reference space for analysis***°**. The 
common template (Cell07) was from ref. 22, so 236 projection neuron images 
from that study did not require further processing. The anti-Discs large channel of 
34 confocal images from ref. 38 was registered to a single female template brain 
(FlyCircuit ID 6475, FC6475), which was chosen for good quality staining and 
imaging, using the CMTK toolkit (http://www.nitrc.org/projects/cmtk) as 
described previously”. The FC6475 template brain was then registered to the 
Cell07 brain using landmarks registration based on 24 manually chosen anatomical 
landmarks visible in both anti-Discs large and nc82 staining. This landmarks regis- 
tration used the pnreg command of the IRTK toolkit (http://www.doc.ic.ac.uk/ 
~dr/software), which uses nonlinear third-order B-spline registration”’. The fidu- 
cial registration error, measured as the root mean squared deviation, was 5.1 pm 
after affine registration and 2.4 tm after warping registration, corresponding to per 
axis accuracies of 2.9 and 1.4 1m, respectively. The final per axis registration accu- 
racy for independent landmarks (not used during the registration) was 3.3 um, 
which is comparable to that of previous studies”?**. 

Three VL2a projection neuron images from ref. 37 were registered to the Cell07 
template by choosing 12-14 landmarks in the nc82 channel present in each stack. 
Landmarks registration used the pnreg command of IRTK. Two VL2a projection 
neuron images from ref. 36 were registered to the Cell07 template using 10-14 
landmarks. Although those brains had weak nc82 staining, the only reliable marker 
was GH146-GAL4-driven expression of a CD2 reporter. We therefore used a two- 
channel confocal image of an nc82-stained brain with GH146-driven mCD8:GFP 
expression that had been registered by way of its nc82 channel to the Cell07 
template to help choose landmarks. Landmarks registration again used the pnreg 
command of IRTK. The fiducial registration error (root mean squared deviation) 
for the five brains varied between 3.6 and 4.9 ttm (affine) and 1.8 and 3.1 pm 
(warp). 

Neuronal tracing was carried out in Amira (http://www.amira.com) with the 
hxskeletonize plug-in*®. The three-dimensional coordinates of neuronal tracings 
were then transformed from their respective original image coordinates using the 
CMTK gregxform tool (FlyCircuit images + FC6475 template) and the IRTK 
ptransformation tool (FC6475 — Cell07, three VL2a images — FC6475). The 
underlying tools were called using custom code written in R (http://www.r-project. 
org), which was also used to analyse the transformed neuronal tracings (using 
the AnalysisSuite R package)”. The final visualization was carried out in Amira 
using Tcl scripts to load, show/hide and colour neuron tracings and to take 
snapshots. R and Tcl source code, landmark files, template brain images and 
neuronal tracings will be available on publication at the Jefferis laboratory website 
(http://flybrain.mrc-lmb.cam.ac.uk). 

Analysis of axonal overlap. The degree of overlap between the axon terminals of 
different projection neurons was measured by making use of the three-dimensional 
convex hull of each terminal (using the Qhull library"! as exposed by the R geometry 
package). It may help to note that for a set of two-dimensional points in the plane, 
the convex hull can be obtained by stretching an elastic band to encompass the 
whole object and then releasing it. To obtain an overlap score for a projection 
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neuron pair (A,B), we calculated three convex hulls, H(A), H(B) and H(A,B), where 
H(A,B) is the convex hull for all of the points in A and B. We then calculated a 
normalized overlap score as s(A,B) = (H(A) + H(B) — H(A,B))/(H(A) + H(B)). 

If A and B are identical (that is, complete overlap), the score will be 0.5. Less 
overlap will result in lower scores, with negative scores when the amount of 
intervening space between the axon terminals exceeds the overlap (if any). We 
first calculated an overlap score for all 275 neurons in the data set. We then 
aggregated the scores for all pairwise combinations of the 46 projection neuron 
classes in the data set and calculated the median score for each combination. This 
resulted in a symmetrical distance matrix with 46 < 45/2 = 1,035 unique off- 
diagonal entries. The median overlap score for VL2a and VAIIm projection 
neurons was 0.224, which was at the 97.7th centile: that is, the overlap between 
these two classes was among the strongest 2.3% in the data set. VL2a was the 
neuronal class with the strongest overlap with VA1lm (and vice versa). Cluster 
analysis of these data was performed with Ward’s method, using the hclust func- 
tion of R. Overlap scores, s, were converted to a distance, d, suitable for clustering 
by the simple transform d= 0.5 — s. 

The VL2a projection neuron image data were obtained in separate studies from 

the bulk of the projection neuron data, so it is natural to ask whether this could 
have had some effect on our analysis. However, the consistent results for VL2a 
projection neuron images obtained from two different laboratories argue very 
strongly against this possibility. Visual inspection indicated that there was excel- 
lent overlap between the VL2a neurons from the two sources, and the pairwise 
overlap scores between and within the two groups were almost identical (median 
0.284 and median 0.301, corresponding to the 98.0th and 98.8th centiles in 
Supplementary Fig. 4b). As an additional check, we repeated the axon overlap 
analysis and clustering with affine registered VL2a neurons (affine registration is 
more robust, although it is less accurate in the ideal situation). This had a negligible 
effect on the VL2a/VA11m median overlap score (0.197 and 0.224, corresponding 
to the 96.2th centile and 97.7th centile, respectively) and had no effect on the 
clustering results (G.S.X.EJ., data not shown). 
Phylogenetic analysis. The phylogenetic tree of selected antennal-expressed IR 
orthologous groups” was constructed using PhyML” and is based on the most 
conserved columns of an amino acid alignment generated with MUSCLE”. The 
phylogeny was rooted using IR25a. iTOL“ was used to view and graphically edit 
the tree. 
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The innate immune system detects infection by using germline- 
encoded receptors that are specific for conserved microbial molecules. 
The recognition of microbial ligands leads to the production of 
cytokines, such as type I interferons (IFNs), that are essential for 
successful pathogen elimination. Cytosolic detection of pathogen- 
derived DNA is one major mechanism of inducing IFN produc- 
tion’’, and this process requires signalling through TANK binding 
kinase 1 (TBK1) and its downstream transcription factor, IFN- 
regulatory factor 3 (IRF3). In addition, a transmembrane protein 
called STING (stimulator of IFN genes; also known as MITA, ERIS, 
MPYS and TMEM173) functions as an essential signalling adaptor, 
linking the cytosolic detection of DNA to the TBK1-IRF3 signalling 
axis*’. Recently, unique nucleic acids called cyclic dinucleotides, 
which function as conserved signalling molecules in bacteria®, have 
also been shown to induce a STING-dependent type I IFN res- 
ponse”’. However, a mammalian sensor of cyclic dinucleotides 
has not been identified. Here we report evidence that STING itself 
is an innate immune sensor of cyclic dinucleotides. We demonstrate 
that STING binds directly to radiolabelled cyclic diguanylate 
monophosphate (c-di-GMP), and we show that unlabelled cyclic 
dinucleotides, but not other nucleotides or nucleic acids, compete 
with c-di-GMP for binding to STING. Furthermore, we identify 
mutations in STING that selectively affect the response to cyclic 
dinucleotides without affecting the response to DNA. Thus, 
STING seems to function as a direct sensor of cyclic dinucleotides, 
in addition to its established role as a signalling adaptor in the IFN 
response to cytosolic DNA. Cyclic dinucleotides have shown promise 
as novel vaccine adjuvants and immunotherapeutics””’, and our 
results provide insight into the mechanism by which cyclic dinucleo- 
tides are sensed by the innate immune system. 

Nucleotides are crucial signalling molecules in all domains oflife, but 
cyclic dinucleotides seem to be produced solely by Bacteria and 
Archaea. For example, c-di-GMP is a ubiquitous second messenger 
that regulates biofilm formation, motility and virulence in a diverse 
range of bacterial species*. Recently, cyclic diadenylate monophosphate 
(c-di-AMP) was discovered to be a bacterial regulatory molecule”, 
although its role remains to be fully characterized. Because they are 
unique to microorganisms, cyclic dinucleotides are appropriate targets 
for immune recognition’*. Indeed, the induction of IFN production by 
Listeria monocytogenes depends on bacterial secretion of cyclic- 
di-AMP’”. However, it remains unclear how cyclic dinucleotides are 
sensed in mammalian cells. 

To address the mechanism by which mammalian cells sense cyclic 
dinucleotides, we first confirmed that cyclic dinucleotides are detected 
in the host cell cytosol’? by expressing RocR, a c-di-GMP-specific 
phosphodiesterase from Pseudomonas aeruginosa, in the cytosol of 
macrophages. In these cells, IFN induction in response to c-di-GMP 
(but not other stimuli) is tenfold lower than that in control, vector- 
transduced, cells (Fig. 1a), confirming that the cytosolic presence of 
c-di-GMP is important for inducing IFN. 


To identify candidate cyclic dinucleotide sensors, we sought to 
identify molecules that could reconstitute the IFN response to cyclic 
dinucleotides in HEK293T cells, which do not respond to c-di-GMP"”. 
Because STING is essential for the IFN response to cyclic dinucleo- 
tides'' and because STING expression is low or undetectable in 
HEK293T cells (Supplementary Fig. 1 and data not shown), we first 
expressed STING in HEK293T cells. The overexpression of STING 
spontaneously induces an IFN reporter*®*’, so we transfected a small 
amount of Sting-encoding vector that by itself was insufficient to 
induce IFN. Low levels of STING protein were sufficient to reconstitute 
the responsiveness of HEK293T cells to c-di-GMP (Fig. 1b) and 
c-di- AMP (Fig. 1c). By contrast, the non-functional goldenticket (gt) 
allele of Sting (which results in a STING protein in which asparagine 
has been substituted for isoleucine, I199N)"' did not restore respon- 
siveness to c-di-GMP (Fig. 1b). Interestingly, the expression of wild- 
type STING did not reconstitute the responsiveness of HEK293T cells 
to double-stranded DNA (dsDNA) oligonucleotides (for example, a 
70-base-pair oligonucleotide from vaccinia virus (VV70mer) or IFN- 
stimulatory DNA (ISD)) that had previously been shown to induce 
type I IFNs in macrophages through STING*’® (Fig. 1b and Sup- 
plementary Fig. 2a). By contrast, the induction of IFN by poly(dA- 
dT)epoly(dT-dA) DNA (denoted poly(dAT:dTA)) was identical in 
cells that were transfected with wild-type Sting and those transfected 
with gt, demonstrating that the RNA polymerase III DNA-sensing 
pathway’”"* is intact in these cells and is not responsible for the detec- 
tion of c-di-GMP (Fig. 1d). As a positive control, Myd88"'~ Trif '~ 
immortalized macrophages, which express STING, responded 
similarly to c-di-GMP, poly(dAT:dTA), VV70mer and ISD (Fig. le 
and Supplementary Fig. 2b). Together, our results show that STING 
expression is sufficient to restore the responsiveness of HEK293T cells 
to cyclic dinucleotides but not to DNA. 

We next tested whether STING, or perhaps another protein in 
HEK293T cells, binds to c-di-GMP. We used HEK293T cell lysates 
in an in vitro ultraviolet radiation crosslinking assay to identify putative 
sensor proteins that interact directly with radiolabelled c-di-GMP 
(c-di-[**P]GMP). We expected to identify directly interacting proteins 
because only molecules within bond-length proximity are efficiently 
crosslinked by ultraviolet radiation’. We detected a prominent ~40- 
kDa radiolabelled protein, which corresponds to the predicted 
molecular weight of monomeric STING, in the lysates of cells trans- 
fected with a vector encoding haemagglutinin (HA)-tagged STING 
(STING-HA) but not in the lysates of cells transfected with a vector 
encoding STING(I199N)-HA or vector only (Fig. 2a). The ~40-kDa 
band did not appear when the same lysates were crosslinked in the 
presence of [°°P]GTP, implying that crosslinking to c-di-[??P]GMP 
was specific (Fig. 2a). We also observed an ~80-kDa species, which 
might correspond to a previously reported STING dimer® (Fig. 2b). 
To test the hypothesis that STING crosslinks with c-di-[??P]GMP, 
we immunoprecipitated STING from transfected HEK293T 
cells and performed the c-di-[**P]GMP crosslinking assay on the 
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immunoprecipitates. Bands corresponding to the molecular weight of 
the STING monomer and dimer were identified only in immuno- 
precipitates from lysates overexpressing STING and not in mock 
immunoprecipitates from lysates of vector-only-transfected cells 
(Fig. 2b). Thus, STING seems to bind to c-di-GMP. 

To confirm that the binding of c-di- [°°P]GMP to STING is specific, 
we performed the c-di-GMP crosslinking assay in the presence of 


Figure 1 | STING is sufficient to restore responsiveness to cyclic 
dinucleotides. a, Immortalized Myd88 ’ Trif ‘~ macrophages were 
transduced with retrovirus expressing rocR and then stimulated for 6h. IFN 
induction was measured by quantitative PCR with reverse transcription 
(quantitative RT-PCR) and normalized to ribosomal protein 17 (Rps17) 
expression levels. b-d, HEK293T cells were transfected as indicated, together 
with an IFN-luciferase reporter, and luciferase activity was measured 6 h after 
stimulation. e, Myd88"’~ Trif ‘~ macrophages were stimulated for 6h, and 
IEN induction was measured as in a. a—e, Data are presented as mean + s.d. 
(n = 3) and are representative of at least three independent experiments. ***, 
P<0.001. SeV, Sendai virus; TMEV, Theiler’s murine encephalomyelitis virus; 
unstim, unstimulated; VV70mer, a stimulatory dsDNA oligonucleotide derived 
from vaccinia virus. 


unlabelled nucleotides. Unlabelled c-di-GMP and c-di-AMP specif- 
ically competed with c-di-[?’?P]GMP for binding to STING (Fig. 2c, 
d). By contrast, GTP, other guanosine derivatives and nucleic acids 
(including dsDNA) competed away nonspecific binding (Fig. 2c, d, 
asterisk); however, under our specific assay conditions, these molecules 
could not compete efficiently with c-di-[°*P]GMP for binding to 
STING (Fig. 2c, d, arrow). Because the cell cytosol contains high con- 
centrations of GTP (0.1-1mM), a putative c-di-GMP sensor must 
have a high degree of specificity for c-di-GMP over GTP. We found 
that c-di-GMP efficiently crosslinked to STING even in the presence of 
1 mM GTP (Fig. 2c). 

Although these data imply that STING directly and specifically 
binds to cyclic dinucleotides, they do not address whether other host 
proteins might also be required. STING is predicted to encode an 
amino-terminal domain with multiple transmembrane segments, fol- 
lowed by a globular carboxy-terminal domain (CTD). Because the 
CTD contains the amino acid substitution that abolishes STING func- 
tion in gt mice (I199N)"’, we suspected that the CTD might be involved 
in binding to cyclic dinucleotides. Thus, we subjected purified recom- 
binant Hisg-tagged STING CTD (amino acids 138-378) (Fig. 2e) to the 
c-di-[?*P]GMP crosslinking assay. We found that the recombinant 
CID of STING bound to c-di-[?’?P]GMP and that this binding 
was specifically competed away with cold (unlabelled) c-di-GMP or 
c-di-AMP but not with cold GTP or ATP (Fig. 2f). We used equilibrium 
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Figure 2 | STING binds cyclic dinucleotides. a, HEK293T cells were 
transfected as indicated, and the cell lysates were subjected to an in vitro 
ultraviolet radiation crosslinking assay with c-di-[°7P]GMP. Samples were 
separated by SDS-PAGE and visualized by autoradiography or western 
blotting. b, HEK293T cells were transfected as in a, and anti-HA 
immunoprecipitates were stained with colloidal blue or subjected to the 
crosslinking assay with c-di- [°°P]GMP. c, d, HEK293T cells were transfected as 
in a, and the cell lysates were crosslinked with ultraviolet radiation to c-di- 
[°°P]GMP or [**P]GTP in the presence of cold competing nucleotides in 
tenfold serial dilutions beginning at 1 mM (as indicated by the wedges), except 
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[o-di-GMP]j,05 (1M) 


for guanosine (beginning at 0.1 mM), VV70mer (500 pg ml’), 
poly(dAT:dTA) (50 pg ml ') and poly(I)*poly(C) (denoted poly(I:C)) 

(50 pg ml _'). The arrow indicates STING, and the asterisk indicates a 
nonspecific band. nt, nucleotides; ppGpp, guanosine-3’,5’-bisdiphosphate. 

e, Hiss-STING (amino acids 138-378) (1 ig) was separated by SDS-PAGE and 
stained with Coomassie blue. f, Hiss-STING 138-378 was analysed as in 

c. g, The binding of c-di-GMP to purified Hiss-STING (10 1M) was measured 
by equilibrium dialysis. B,,.x, maximum number of binding sites; h, hill 
coefficient; Kg, dissociation constant. a-g, Data are representative of three 
independent experiments. 
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dialysis to obtain an estimate of the affinity (dissociation constant, 
Kg) of c-di-GMP binding to the STING CTD, which was ~5 uM 
(Fig. 2g). In its native membrane-bound form, or in complex with 
other host factors, STING may have a stronger affinity for c-di- 
GMP; nevertheless, a 5 [1M affinity is consistent with the dose response 
that has previously been observed in macrophages’”. Consistent with the 
ability of STING to dimerize’®, the binding data suggest a stoichiometry 
of one molecule of c-di-GMP per two molecules of STING. 

To identify the amino acids involved in c-di-GMP binding and/or 
IEN induction, we introduced point mutations into STING. Focusing 
on clusters of conserved and charged residues, we mutated 67 amino 
acids, either individually or in groups, and we classified these mutants 
into five categories (Fig. 3, Supplementary Table 1 and Supplementary 
Figs 3 and 4). Class I consists of mutants that abolish both binding and 
IEN induction (Fig. 3a—c, red, and Supplementary Table 1). Class II 
mutants bind to c-di-GMP but fail to induce IFN (Fig. 3c, purple). 
Class III comprises ‘hyperactive’ mutants, which spontaneously induce 
IFN at low levels of transfection (Fig. 3a—c, green, and Supplementary 
Table 1). Class IV mutants induce IFN when overexpressed but do not 
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Figure 3 | Mutational analysis of STING. a, HEK293T cells were transfected 
with vectors encoding STING and STING mutants as indicated, together with 
an IFN-luciferase reporter, and luciferase activity was measured 6h after 
stimulation. Data are presented as mean + s.d. (n = 3). b, HEK293T cells were 
transfected as in a, except that the lysates were subjected to the crosslinking 
assay with c-di-[°*P]GMP as in Fig. 2a. c, Organization of STING based on the 
membrane topology prediction programs SOSUI, TMHMM, HMMTOP and 
TMpred. Strongly predicted transmembrane domains are boxed; weakly 
predicted transmembrane domains have dashed boxes. Coloured residues 
indicate mutant classes (see text for description of mutants): class I (red), class I 
(purple), class III (green), class IV (blue) and class V (yellow). Bracketed 
mutations were made in combination. a-c, Data are representative of at least 
three independent experiments. 
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respond to c-di-GMP (Fig. 3a-c, blue, and Supplementary Table 1). 
Class V consists of mutants that have no effect on c-di-GMP binding or 
IFN induction (Fig. 3c, yellow, and Supplementary Table 1). Although 
mutating STING can result in diverse phenotypes, a key finding is that 
all mutants that failed to bind to c-di-GMP also lost the ability to 
induce IFN in response to c-di-GMP. Consistent with our observation 
that the CTD is sufficient for binding to c-di-GMP (Fig. 2f), all muta- 
tions that affected c-di-GMP binding were located within the CTD. 
DNA and cyclic dinucleotides induce indistinguishable transcrip- 
tional responses in macrophages’’, and STING seems to be essential 
for both responses*’’. However, we found that STING expression is 
insufficient to restore the responsiveness of HEK293T cells to DNA, in 
contrast to cyclic dinucleotides (Fig. 1b and Supplementary Fig. 2a). 
Moreover, our competition assays indicate that DNA does not com- 
pete with cyclic-di-GMP for binding to STING under the conditions 
tested (Fig. 2d). Thus, although our data indicate that STING functions 
as a direct immunosensor of cyclic dinucleotides, it seems likely that 
additional host proteins are involved in IFN induction by DNA. 
Indeed, two candidate DNA sensors, DAI (also known as ZBP1) and 
IFI16, have been identified'®”’, neither of which seems to be essential 
for the response to cyclic dinucleotides (ref. 10 and data not shown). 
To determine whether the responsiveness to cyclic dinucleotides and 
DNA are separable functions of STING, we sought to identify STING 
mutants that fail to respond to cyclic dinucleotides but still respond to 
DNA. We identified a STING mutant (R231A) that was unresponsive 
to c-di-GMP (Fig. 4a), although it induced IFN when overexpressed 
(Fig. 4a) and bound to c-di-GMP (Fig. 4b). Interestingly, STING 
R231A was able to restore the responsiveness of gt bone marrow 
macrophages to DNA but not to cyclic-di-GMP (Fig. 4c). Thus, cyclic 
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Figure 4 | The IFN response to DNA and c-di-GMP can be uncoupled. 

a, HEK293T cells were transfected as indicated, together with an IFN-luciferase 
reporter, and IFN reporter activity was measured 6h after stimulation. 

b, HEK293T cells were transfected as in a, except that the lysates were subjected 
to the crosslinking assay with c-di-[°*P]GMP as in Fig. 2a. c, Bone-marrow- 
derived macrophages from Sting-deficient (gt) mice were transduced with the 
indicated constructs. IFN induction in response to transfected cyclic-di-GMP 
or VV70mer was measured by quantitative RT-PCR and normalized to Rps17. 
** P< 0.001; NS, not significant, P = 0.1205. a, c, Data are presented as 
mean + s.d. (n = 3). a-c, Data are representative of at least three independent 
experiments. 
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dinucleotide sensing and DNA sensing can be uncoupled, suggesting 
that these two pathways are discrete but share STING as a common 
signalling molecule. It is unexpected that STING would function both 
as a direct immunosensor (of cyclic dinucleotides) and as a signalling 
adaptor (in the response to DNA). One possibility is that STING 
initially evolved as a cyclic dinucleotide sensor and was subsequently 
co-opted for DNA sensing. 

We previously used mouse mutagenesis to identify STING as an 
essential molecule in the in vivo IFN response to cyclic dinucleotides". 
The requirement for STING can now be rationalized by our proposal 
that STING functions as a direct sensor of cyclic dinucleotides. 
Interestingly, STING does not share homology with any known 
immunosensor and therefore seems to represent a novel category of 
microbial detector. Although a BLAST search of the mouse proteome 
for homologues of the L. monocytogenes diadenylate cyclase (Imo2120; 
also known as DacA) identifies STING as the top hit, the homology is 
limited to a short region of the STING CTD (amino acids 311-358). 
STING does not seem to share homology with PilZ-domain-containing 
proteins, which function as c-di-GMP receptors in bacteria®. Structural 
studies are required to better characterize the interaction of STING with 
cyclic dinucleotides and to determine whether STING resembles any 
known protein in mammals or bacteria. 

Numerous studies have demonstrated that cyclic dinucleotides are 
potent immunostimulatory compounds that may be valuable as novel 
immunotherapeutics or adjuvants”’*. The therapeutic development of 
cyclic dinucleotides will be greatly facilitated by an improved under- 
standing of the mechanism by which they are sensed. Furthermore, 
our finding that STING is a direct detector of cyclic dinucleotides 
provides insight into the fundamental mechanisms by which the 
innate immune system can detect bacterial infection. 


METHODS SUMMARY 

Transfections. Transfections were carried out using Lipofectamine 2000 
(Invitrogen) according to the manufacturer’s instructions. Digitonin permeabili- 
zation was used to introduce c-di-AMP into cells as described previously’’. 
Recombinant STING. DNA encoding the CTD of mouse STING (nucleotides 
414-1,137) was cloned into the vector pET28a for recombinant protein expression 
in Escherichia coli. 

Ultraviolet radiation crosslinking. c-di-[??P]GMP was enzymatically synthesized 
using recombinant WspR and was used in an ultraviolet radiation crosslinking 
assay as described previously’. Briefly, 50 1g HEK293T cell lysate at a final con- 
centration of 21g’, or 1 pg recombinant His-tagged STING, was incubated 
with 2 Ci c-di-[?*P]GMP in binding buffer (20 mM Tris-HCl, pH7.4, 200mM 
NaCl and 1mM MgCl) for 15min at 25°C. The reactions were irradiated at 
254 nm, and the proteins in the samples were then separated by SDS-PAGE. 
Statistical analysis. Statistical differences were calculated with an unpaired two- 
tailed Student’s t-test using Prism 5.0b software (GraphPad). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Synthesis of c-di- [?P]GMP. The synthesis of c-di-[?’P]GMP was carried out as 
described previously”'. Briefly, recombinant His,-tagged WspR was incubated 
with [o-**P]GTP (3,000Cimmol ', 10pCipl~', Amersham Biosciences) for 
2h at 25°C, followed by heat inactivation of WspR at 95 °C for 5 min. Residual 
*°p was removed by incubation with calf intestinal phosphatase (CIP) (New 
England Biolabs) for 10min at 37°C. CIP was heat inactivated at 95°C for 
5 min, followed by centrifugation at 16,000g for 5 min. The [*P]GTP used as a 
negative control was prepared identically except that His-tagged WspR and CIP 
were omitted from the preparation. Radiolabelled nucleotides were quantified by 
separation by thin-layer chromatography on cellulose-PEI plates (Macherey- 
Nagel) using 1.5 M KH,PO,, pH 3.65 (Supplementary Fig. 5). 

Cell lines and animals. C57BL/6 Myd88 ‘~ Trif ‘~ mice were obtained from G. 
Barton, and immortalized macrophages were generated as described previously”. 
Immortalized bone marrow macrophages were maintained in RPMI 1640 
(Invitrogen) supplemented with 10% FBS, penicillin-streptomycin and glutamine. 
HEK293T cells were maintained in DMEM supplemented with 10% FBS, penicillin- 
streptomycin and glutamine. Animal use was approved by the Animal Care and Use 
Committee at the University of California, Berkeley. 

Plasmids. A construct encoding RocR (NP_252636) from P. aeruginosa’? was a gift 
from S. Lory. rocR was cloned into the MSCV2.2 retroviral expression construct 
upstream of an internal ribosome entry site (IRES)-green fluorescent protein 
(GEP). MSCV-rocR was transduced into immortalized macrophages from 
Myds8"'" Trif ‘~ mice, and cells were sorted for GFP expression. Mouse Sting 
and the gt (I199N) mutant allele of Sting were cloned into the vector pcDNA3 with 
a C-terminal HA tag as described previously''. DNA encoding the CTD of mouse 
Sting (nucleotides 414-1,137) was cloned into pET28a for recombinant protein 
expression in Escherichia colli. 

Site-directed mutagenesis. Mutations in Sting were generated using the 
QuikChange Site-Directed Mutagenesis Kit (Stratagene) according to the manu- 
facturer’s guidelines. 

Reagents. c-di-GMP was synthesized as described previously”. Purified c-di-AMP 
was a gift from J. Woodward and D. Portnoy. Poly(dAT:dTA), GTP, ATP, GMP and 
guanosine were obtained from Sigma-Aldrich. Poly(I)*poly(C) (denoted poly(I:C)) 
was purchased from Invivogen. Guanosine-3’,5’-bisdiphosphate (ppGpp) was 
obtained from TriLink Biotechnologies. Sendai virus was purchased from Charles 
River Laboratories. Theiler’s murine encephalomyelitis virus (TMEV) strain GDVII 
was provided by M. Brahic and E. Freundt. DNA oligonucleotides corresponding to 
the VV70mer and ISD were purchased from Elim Biopharmaceuticals and were 
annealed as described previously*”*. 

Cell stimulation. All transfections (excluding for c-di-AMP) were carried out 
using Lipofectamine 2000 (Invitrogen) according to the manufacturer’s instruc- 
tions. The VV70mer was transfected at a final concentration of 0.5 gml'. 
Poly(dAT:dTA), poly(I:C) and c-di-GMP were transfected at a final concentration 
of 41g ml |. c-di-AMP was used at a final concentration of 5.4mM, and stimu- 
lation was performed using digitonin permeabilization as described previously’’. 
Luciferase assay. HEK293T cells were plated in TC-treated 96-well plates at 
0.5 X 10° cells ml}. The next day, the cells were transfected as indicated, together 
with IFN-f-firefly luciferase and TK-Renilla luciferase reporter constructs. 
Following stimulation for 6h with the indicated ligands, the cells were lysed in 
passive lysis buffer (Promega) for 5 min at 25°C. The cell lysates were incubated 
with firefly luciferase substrate (Biosynth) and the Renilla luciferase substrate 
coelenterazine (Biotium), and luminescence was measured on a SpectraMax L 
microplate reader (Molecular Devices). The relative Ifnb expression was calculated 
as firefly luminescence relative to Renilla luminescence. 

Quantitative PCR. The analysis of Ifnb expression by bone marrow macrophages 
was conducted as described previously”’. 

Preparation of HEK293T cell lysates and immunoprecipitations. HEK293T 
cells were plated at a density of 1 X 10° cells well” ' in a 6-well plate. The following 
day, the cells were transfected with pcDNA3 or pcDNA3 expressing HA-tagged 
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wild-type or mutant STING using Lipofectamine 2000 (Invitrogen). The day after 
that, the cells were rinsed once with PBS and transferred to Eppendorf tubes in PBS 
containing 1 mM EDTA. The cells were pelleted briefly by centrifugation at 1,000g 
at 4°C. The cell pellet was lysed in an equal volume of digitonin lysis buffer (0.5% 
digitonin, 20 mM Tris-HCl, pH 7.4, and 150 mM NaCl) containing protease inhi- 
bitors (Roche) for 10 min on ice. The cell lysates were centrifuged at 10,000g for 
10 min at 4°C. The protein concentration in the resultant supernatant was mea- 
sured using the Bradford reagent (Bio-Rad).The cell lysates were subjected to a 
c-di-GMP binding (crosslinking) assay (see below). The lysates were separated by 
SDS-PAGE, and the separated proteins were transferred to a nitrocellulose mem- 
brane, which was then probed with rat anti-HA antibodies (Roche), to confirm 
STING-HA expression, and mouse anti-B-actin antibodies (Santa Cruz 
Biotechnology). To immunoprecipitate HA-tagged STING, the cell lysates were 
prepared similarly in digitonin lysis buffer and incubated with anti- HA-antibody- 
conjugated agarose beads (Sigma) for 2 h at 4 °C. Washed beads were subjected to 
a c-di-GMP binding assay or separated by SDS-PAGE and stained with colloidal 
blue protein stain (Thermo Scientific). 

c-di-GMP binding assays. The c-di-GMP binding assay (also called the cross- 
linking assay) was based on a method described previously~’. Briefly, 50 1g 
HEK293T cell lysate at a final concentration of 2 ug pl‘, or 1 pg recombinant 
His,-tagged STING, was incubated with 2 \1Ci radiolabelled nucleotide in binding 
buffer (20 mM Tris-HCl, pH 7.4, 200 mM NaCl and 1mM MgCl) for 15 min at 
25 °C. The reactions were irradiated at 254 nm for 20 min on ice at a 3-cm distance 
from a UVG-54 mineral light lamp (UVP). Immediately after crosslinking, the 
reactions were terminated by the addition of SDS sample buffer (40% glycerol, 8% 
SDS, 2% 2-mercaptoethanol, 40mM EDTA, 0.05% bromophenol blue and 
250 mM Tris-HCl, pH 6.8), boiled for 5 min and then separated by SDS-PAGE. 
The gels were dried, exposed to a phosphor screen and visualized using a Typhoon 
Trio imager (GE Healthcare). 

Protein purification. The construct expressing a constitutively active form of 
WspR (pQE-WspR*)”** was a gift from S. Lory. The purification of His,-tagged 
WspR was carried out as described previously, using Ni-NTA affinity chromato- 
graphy (Qiagen)**. DNA encoding the CTD of mouse STING (nucleotides 414- 
1,137) was cloned into the vector pET28a and purified by Ni-NTA affinity 
chromatography (Qiagen) according to the manufacturer’s instructions. 
Equilibrium dialysis. The binding affinity of radioactive c-di-GMP was measured 
by equilibrium dialysis, using a 96-well equilibrium dialyser (Harvard Apparatus) 
with a 5,000 molecular weight cut-off membrane. One chamber contained 150 pl 
10M purified Hisg-tagged STING(138-378) in assay buffer (25 mM Tris-HCl, 
pH7.4, 100 mM NaCl, 1 mM MgCl, and 10% glycerol), while the other was filled 
with 150 ul c-di-[??P]GMP ata range of concentrations (40-160 j.M). Equilibrium 
was reached after 48 h at 25 °C, and three samples were drawn from each chamber 
and mixed with 2 ml Econo-Safe scintillation fluid. Samples were measured in an 
LS 6000 IC scintillation counter (Beckman). Data analysis was performed using 
Prism 5.0b software (GraphPad). The dissociation constant (Ky), the maximum 
number of binding sites (B,,.x) and the Hill coefficient (h) were generated using 
nonlinear regression, allowing one-site specific binding with a Hill slope. 
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Control of flowering and storage organ formation in 
potato by FLOWERING LOCUS T 


Cristina Navarro’, José A. Abelenda'*, Eduard Cruz-Oro', Carlos A. Cuéllar', Shojiro Tamaki’, Javier Silva’, Ko Shimamoto” 
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Seasonal fluctuations in day length regulate important aspects of 
plant development such as the flowering transition or, in potato 
(Solanum tuberosum), the formation of tubers. Day length is 
sensed by the leaves, which produce a mobile signal transported 
to the shoot apex or underground stems to induce a flowering 
transition or, respectively, a tuberization transition. Work in 
Arabidopsis, tomato and rice (Oryza sativa) identified the mobile 
FLOWERING LOCUS T (FT) protein as a main component of the 
long-range ‘florigen’, or flowering hormone, signal'*. Here we 
show that expression of the Hd3a gene, the FT orthologue in rice, 
induces strict short-day potato types* to tuberize in long days. 
Tuber induction is graft transmissible and the Hd3a-GFP protein 
is detected in the stolons of grafted plants, transport of the fusion 
protein thus correlating with tuber formation. We provide evid- 
ence showing that the potato floral and tuberization transitions are 
controlled by two different FT-like paralogues (StSP3D and 
StSP6A) that respond to independent environmental cues, and 
show that an autorelay mechanism involving CONSTANS modu- 
lates expression of the tuberization-control StSP6A gene. 

Potato, the third largest global food crop after wheat and rice, is 
cultivated for its underground storage stems or tubers, which are rich 
in starch and other nutrients. Short days and cool temperatures pro- 
mote tuber formation, ensuring that differentiation of these vegetative 
propagation organs precedes winter. Whereas cultivated potatoes 
derive from Chilean landraces more adapted to long-day conditions, 
Andean types such as the S. tuberosum group Andigena (Solanum 
tuberosum andigena) tuberize only in short days. These plants require 
night periods longer than a critical length, as a pulse of light during the 
night (a ‘night break’) represses tuberization, as seen in strict short-day 
flowering plants®”’. 

Day length is sensed by expanded leaves, which synthesize a mobile 
signal or “tuberigen’ that is transported to the underground stems to 
induce tuber formation*. This long-distance signal shares several fea- 
tures with the mobile florigen, with different pieces of evidence sug- 
gesting that related photoperiodic pathways may control synthesis of 
both signals*. In Arabidopsis thaliana, activation of the FT gene by 
CONSTANS (CO) mediates floral transition in long days”'®. 
Transport of the FT protein from the leaf to the shoot apical meristem 
has been demonstrated in diverse ways''""’, it being broadly accepted 
that this phloem mobile protein functions as the florigen. Closely 
related genes also mediate control of flowering in rice, although in this 
short-day plant the CO orthologue Heading date 1'* (Hd1) activates 
expression of the FT-like Hd3a gene in short days but represses its 
expression under long-day conditions’*"*. 

A CO-dependent pathway is also thought to mediate short-day 
tuberization, as expression of Arabidopsis CO in Andigena plants 
delays tuber formation in short days’. High light irradiance, however, 
induces Andigena potatoes to flower in long days, although these 
conditions are restrictive for tuberization®. This long-day flowering 


response seemed to argue against a role for FT in tuberization, implic- 
ating an additional CO target as the mobile tuberigen. Here we show 
that ectopic expression of the rice Hd3a gene induces Andigena plants 
to flower and tuberize under non-inductive long days, demonstrating 
the potential of FT to act as the mobile tuberigen. We show, in addi- 
tion, that flowering and short-day tuberization responses are regulated 
by two members of the potato FT-like gene family that respond to 
different environmental cues. 

To assess whether FT has a role in tuberization, we transformed 
Andigena plants with the rolC::Hd3a—GFP construct (GFP, green fluor- 
escent protein), which in rice promotes floral transition in long days’’. 
Lines expressing this construct were induced to flower (Fig. 1b, c) and 
were able to tuberize in non-inductive long days (Fig. 1a). When grafted 
to wild-type plants, these lines induced the wild-type controls to tuberize 
in long days, independently of whether they were used as donors (Hd3a 
onto wild type) or as stocks (wild type onto Hd3a), whereas none of the 
control grafts (wild type onto wild type; Fig. 1d) tuberized in non- 
inductive long days. We detected the Hd3a—-GFP protein but not its 
transcript in the stolons of grafted wild-type stocks (Supplementary Fig. 
la, c), demonstrating that the protein but not the RNA can move across 
the graft junction and function as a powerful tuberization inducer. 

Studies in tomato identified six members of the SELF-PRUNING 
(SP) gene family’’ and sequencing of two diploid potato (Tuberosum 
RH89-039-16 and Phureja DM1-3 516 R44) genomes recently led to 
the identification of three additional FT and TFL family members”. 
Phylogeny of these genes grouped the StSP6A, StSP5G, StSP5G-like 
and StSP3D homologues into the same clade as the Arabidopsis FT, 
tomato SINGLE-FLOWER TRUSS (SFT) and rice Hd3a genes 
(Supplementary Fig. 2). Quantitative PCR with reverse transcription 
(RT-PCR) revealed that these transcripts are expressed in leaves or 
stolons, whereas transcripts for SP/TFL1 and MFT members are more 
ubiquitously distributed (Fig. le). Interestingly, StsP6A gene expres- 
sion strongly correlates with tuberization, high levels of expression 
being observed in leaves and stolons of short-day-induced plants 
and in antisense phytochrome B lines, with constitutive tuberization”' 
(Fig. 1f and Supplementary Information, section 1). This expression 
profile suggests that this gene is involved in tuberization control, 
StSP6A overexpression (StSP6Aox) lines being able to tuberize under 
non-inductive long days (Fig. 2a, b) and induced to flower, although 
their flowering phenotype is less severe than in Hd3a lines (Sup- 
plementary Fig. 3a, b and Supplementary Information, section 2). 
StSP6A silencing, in turn, strongly delays tuber formation in short 
days (Fig. 2c, d), pointing to an essential role for this FT-like protein 
in tuberization promotion. StSP6A expression analyses in commercial 
cultivars with early (Jaerla), late (Baraka) and intermediate (Kennebec) 
tuberization periods, in addition, show that accumulation of this tran- 
script in leaves correlates with the tuberization time of these cultivars 
(Supplementary Fig. 6), indicating that this FT paralogue is involved in 
tuberization control even in non-photoperiodic cultivars. 
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Figure 1 | Phenotype of Andigena roIC::Hd3a-GFP lines and expression 
profiles of the potato FT- and TFLI1-like genes. a, Hd3a lines (centre and 
right) tuberize under long-day conditions. Left, wild-type (WT) control. 

b, c, Early-flowering Hd3a plants (c) relative to wild type (b). d, Tuber 
induction in long days of wild-type plants grafted with Hd3a donors (right) or 
stocks (centre). Wild-type control grafts do not tuberize (left). Red lines 
indicate the graft junction. e, Relative levels of expression of the potato FT- 
andTFLI-like genes. 11707, PGSC0003DMG400011707; 16180, 
PGSC0003DMG400016180; 07111, PGSC0003DMG400007111; 14322, 
PGSC0003DMG400014322; Veg., vegetative apex; Inf., inflorescence apex; NB, 
night break; SD, short day. f, Semiquantitative RT-PCR analysis of StSP6A 
expression in leaves and stolons of plants with different tuberization states 
(indicated on top). The pictures show non-induced stolons or tubers. LD, 
long day. 


Day-neutral tomato flowering is regulated by the FT-homologue 
SFT/SP3D gene’. This gene has been reported to be regulated inde- 
pendently of CO and day length’’”’, its potato orthologue being likely 
to have a role in light-irradiance-dependent flowering. StSP3D- 
silenced lines were actually found to be late flowering (Fig. 2e-h), 
although under short-day conditions they tuberized at the same time 
as untransformed controls (Supplementary Fig. 7 and Supplementary 
Information, section 2), indicating that this gene is important in 
flowering but not in the tuberization transition. Remarkably, in modern 
tomato cultivars SP6A was reported to have an extra nucleotide (T at 
position 421) that leads to a premature stop codon, implying a non- 
functional role for this gene”. StSP3D, in turn, retains a weak short-day 
activation response in Andigena, although transcript levels are lower 
than those of StSP6A (Fig. le). Together, these observations suggest that 
members of the FT-like family in Solanaceae diversified such that SFT/ 
SP3D ceased to be regulated by CO”*”’ and became responsive to other 
environmental cues, such as to be important in day-neutral flowering 
control. SP6A function was lost in modern tomato cultivars but in 
potato plays a major part in tuberization, as day-length-dependent 
activation of this gene mediates strict short-day tuberization of 
Andigena species. Thus, the roles of these two FT paralogues in 
flowering and tuberization control have evolved, at least in part, 
through changes in their expression profiles, both genes encoding 
for functionally similar proteins, as indicated by the finding that 
StSP6A expression in Arabidopsis rescues the late-flowering pheno- 
type of co-1 and ft-1 mutants (Supplementary Fig. 8). 
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Figure 2 | Function of the StSP6A and StSP3D genes in short-day- 
dependent tuberization and day-neutral flowering control. a, b, StSP6A 
expression in leaves of StSP6Aox lines (a) and tuber induction in these lines 
(b) in long days. c, d, StSP6A silencing in leaves of StSP6A RNAi (SP6Ai) lines 
(c) correlates with late tuberization in short days (d). e, Relative StSP3D 
expression in StSP3D RNAi (SP3Di) lines. Error bars, s.d. (n > 3). f, Late 
flowering of StSP3D-silenced lines. g, h, Scanning electron microscopy images 
of the StSP3D RNAi apexes: vegetative apex in L26 (g); apex at inflorescence 
transition in L32 (h). Scale bars, 200 tm. 


Interestingly, two other FT-clade members, StSP5G and StSP5G-like 
(Supplementary Table 2), are expressed under non-inductive long-day 
conditions (Fig. le). This expression profile suggests an antagonistic 
function for StSP6A and these two FT paralogues (Supplementary 
Information, section 3), as recently reported for the sugar beet (Beta 
vulgaris) BvFT1 and BvFT2 genes™. Further studies will be required to 
assess whether these FT-clade members have an inhibitory effect on 
tuberization. 

StSP6A is expressed not only in leaves but also in stolons of tuberiz- 
ing plants (Fig. 1f). Expression in these organs is delayed with respect 
to the leaves, suggesting an autoregulatory loop for the transported 
protein. In support of this relay mechanism, increased levels of 
expression of the endogenous StSP6A transcript are observed in 
Hd3a lines (Supplementary Fig. 10) and wild-type stocks grafted to 
these plants (Supplementary Fig. 10a). Thus, in contrast to the 
Arabidopsis FT or rice Hd3a gene, StSP6A is regulated by a relay 
mechanism that sustains synthesis of the inducing signal in stolons. 
In this regard, a local balance between the SP floral repressor and the 
tomato SFT florigen signal has been shown to contribute to the differ- 
ential flowering response of primary and secondary shoot meristems”. 
Hence, it is possible that having more SP in stolons confers a reduced 
sensitivity to FT, which may explain why floral transition is activated 
without tuber formation, whereas Hd3a overexpression activates both 
developmental processes. 

To rule out that tuberization of grafted stock plants is mostly 
mediated by this relay mechanism, we grafted Hd3a grafts onto 
StSP6A RNA interference (RNAi) stocks to test whether inhibition 
of StSP6A expression blocks Hd3a signalling. As shown in Fig. 3a, 
activation of the endogenous StSP6A gene is strongly reduced in 
StSP6A RNAi compared with the wild-type stocks. Tuberization onset 
is delayed and tuber yield reduced in RNAi lines relative to the Hd3a/ 
wild-type grafted controls (Fig. 3b), owing to impaired signal amp- 
lification, but this does not preclude tuberization of the RNAi stocks, 
highlighting induction by the Hd3a protein. 

Finally, we tested whether this regulatory relay requires CO func- 
tion, by grafting StSP6Aox plants into StCOox stocks and analysing 
endogenous StSP6A gene expression (Fig. 3c-e). Like the rice Hdl 
protein’*’*, StCO represses StSP6A gene expression in long days, and 
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Figure 3 | Autoregulatory loop for StSP6A expression. a, Levels of StSP6A 
transcript in transgenic (left) and grafted (right) plants. Hd3a scions were 
grafted onto wild-type or StSP6A RNAi (SP6Ai) lines. b, Tuber induction in 
grafted plants, scored when 100% of the Hd3a/wild-type grafts were tuberizing. 
c, Relative expression of the endogenous StSP6A transcript in COox and 
SP6Aox plants (left), and in grafts with wild-type and COox stocks (right). 

d, Tuber induction of the grafted plants grown in non-inductive long days. 
Error bars, s.d. of two experimental replicates (n > 8). See Supplementary Fig. 
12 for each replicate result. e, StSP6A autoregulation. 


transfer to short-day conditions relieves this repression (Supplemen- 
tary Information, section 4). In line with this function, activation of the 
StSP6A gene is largely repressed in stolons of the StSP6Aox/StCOox 
grafted plants (Fig. 3c), relative to StSP6Aox/wild-type grafts used as 
controls, which implies that StCO is involved in the autoregulatory 
loop that drives StSP6A expression in stolons. Moreover, StSP6A accu- 
mulates only to basal levels in the stolons of StCOox/wild-type and 
wild-type/wild-type grafts used as negative controls (StSP6A is not 
expressed in long days in StCOox or wild-type scions), which implies 
that expression in these organs requires the mobile protein produced in 
the leaves. 

Our data provide the molecular basis for a long-standing physio- 
logical observation, namely that flowering tobacco plants grafted onto 
non-induced potato stocks induce tuberization of the stock plants, 
irrespective of the photoperiodic requirement of the donor plant’®. 
An additional issue is how flowering and tuberization transitions are 
differentially triggered in response to the mobile FT signal. In 
Arabidopsis, FT interacts with FD to activate expression of floral 
meristem identity genes””**. We observed that Hd3a stolons initiate 
floral buds from the apical meristem (Fig. 4a—c) at the same time as 
they differentiate tubers from the subapical region. Using transgenic 
lines expressing the StSP6A protein under control of an ethanol- 
inducible promoter, tuber-specific transcripts were observed within 4h 
of induction in the stolons (Fig. 4d, e and Supplementary Information, 
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Figure 4 | Floral phenotype of Hd3a stolons and proposed model for flower 
and tuber induction. a-c, Flower induction in Hd3a stolons. Scanning 
electron microscope images of wild-type (a) and Hd3a (b, c) stolon apical 
meristems. Scale bars, 100 um. d, Local StSP6A induction activates StGA2ox1 
gene expression in the stolon. e, Activation of other genes reported to be 
induced early during tuber development. Colour bars represent fold change 
(FC) in gene expression in Alc-StSP6A stolons relative to Alc-uiD controls 
(black bars). Alc, ethanol-inducible promoter; error bars, s.d. of two 
experimental replicates (n > 10). f, Model for regulation of flowering and 
tuberization transitions. See Supplementary Information, section 6, for a 
detailed explanation. 


section 5). This very rapid response precludes transport of any mobile 
signal from the leaves and thus supports the notion that the StSP6A 
protein is the mobile tuberigen transported to below-ground organs. 
Thus, it is possible that StSP6A interacts in stolon subapical cells with 
an as yet unknown transcriptional regulator, or that FD-StSP6A dif- 
ferentially activates a set of target genes specific to these cells, such as 
StGA2ox1 (ref. 29; Fig. 4d). A challenge for the future is to identify such 
stolon-specific StSP6A target genes and/or StSP6A-interacting partners, 
to establish how formation of these storage organs is regulated. 

Identification of FT as a switch for tuberization supports the notion 
that FT function extends beyond flowering induction (Fig. 4f). Other 
studies have implicated FT in seasonal control of growth cessation in 
poplar trees*° and meristem growth termination in tomato”. Thus, FT 
is emerging as a key mobile signal controlling not only flowering but 
also a number of other meristem-associated transitions. 


METHODS SUMMARY 

Plant materials and growth conditions. Andigena 7540 wild-type plants and 
antisense phytochrome B*! and AtCOox (ref. 17) lines were grown in a greenhouse 
under long-day conditions. Growth chambers were used for controlled short-day 
(8h light, 16 h dark) and short-day/night-break (short day plus a 30-min pulse of 
light in the middle of the night) treatments. Transgenic Andigena plants were 
generated by Agrobacterium-mediated transformation of leaf explants. Plants 
were grafted as previously described’’ and cultivated under long-day conditions 
to analyse their tuberization response. 

Plasmid constructs. StSP6A was cloned into the pBinAR binary vector to generate 
StSP6A-overexpressing lines. StSP6A, StSP3D and StCO RNAi constructs were 
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generated by inserting the 3’ non-conserved regions of these genes in opposite 
orientations into the pBIN19RNAi vector. The pGWB2 vector was used for StCO 
overexpression. StSP6A ethanol-inducible lines were generated by transformation 
with the pBinSRNA-GW plasmid containing the StSP6A coding region. Primer 
sets used for these constructs are listed in Supplementary Table 3. 

Real-time RT-PCR analyses. First-strand complementary DNA was synthesized 
from 2 jg total RNA and 1 ul of the reaction used for real-time gene expression 
analysis with the SYBR Green PCR master mix (Applied Biosystems). The actin 8 
gene was used for normalization. Identical procedures were used for semiquantita- 
tive RT-PCR, except that amplification was conducted in a Peltier thermal cycler 
(PTC-200, MJ Research). For sets of specific primers and product lengths, see 
Supplementary Table 3. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Plant materials and growth conditions. Andigena 7540 wild-type lines and the 
antisense phytochrome B”' (phyB) and AtCO overexpresser’” (AtCOox) lines were 
grown in the greenhouse under long-day conditions. Growth chambers were used 
for controlled short-day (8h light, 16h dark), short-day/night-break (short day 
plus a 30-min pulse of light given 8 h after the beginning of the dark period) and 
long-day (16h light/8 h dark) treatments. 

Transgenic Andigena plants carrying the rolC::Hd3a-GFP"*, StSP6A over- 
expression, StSP6A RNAi, StSP3D RNAi, StCOox and StCO RNAi constructs were 
generated by Agrobacterium-mediated transformation of leaf explants as 
described previously’. Different graft combinations were obtained as previously 
described’’ and their tuberization phenotype was scored under long-day condi- 
tions. Tuber formation in StSP6A RNAi and StSP3D RNAi lines was analysed by 
growing the plants under long-day greenhouse conditions until a ten-leaf stage, 
before transferring them to short-day conditions. For flowering studies, plants 
were grown under high-irradiance (>200 Es 'm *) long-day conditions. At 
least nine replicates of each line were used for these studies. 

Arabidopsis constans-1 (co-1) and flowering locus t-1 (ft-1) mutants in the 

Columbia (Col-0) and Landsberg erecta (Ler) backgrounds were transformed 
using the floral dip method. Flowering time was measured as the number of rosette 
leaves at floral initiation under long-day conditions, in at least ten individuals. 
Plasmid constructs. The rolC::Hd3a-GFP construct has been described else- 
where'*. Transcripts corresponding to the StSP6A, StSP3D and StCO genes were 
amplified using primers designed on the tomato sequences (AY186737, AY 186735 
and AY490253, respectively). The StSP6A overexpression construct was generated 
by amplifying the protein-coding region and then inserting the PCR product into 
the Smal site of the pBinAR binary vector, between the 35S promoter and the ocs 
terminator. To silence the StSP6A, StSP3D and StCO transcripts, Andigena plants 
were transformed with RNAi constructs designed on the non-conserved region of 
the genes. The amplification products were cloned into the pENTR/D-TOPO 
plasmid (Invitrogen) and inserted in opposite orientations by recombination with 
the LR Clonase II enzyme (Gateway Technology, Invitrogen) into the pBIN19RNAi 
destination vector. The pBIN19RNAi interference vector was generated by partial 
Xbal/HindIII digestion of the pH7GWIWG2(II) plasmid (http://gateway.psb. 
ugent.be/) and insertion of the Gateway cassette fragment into the same restric- 
tion sites of the pBIN19 binary vector. The StCO overexpression construct was 
generated by amplifying and cloning the protein-coding region into the pENTR/D- 
TOPO plasmid (Invitrogen), and further insertion by recombination into the 
pGWB2 destination vector*’. To generate the StSP6A ethanol-inducible lines, the 
pBinSRNA-GW destination vector was created by insertion of the blunt-ended 
pAlcA-R1-R2-t35S cassette from the AlcAP-GW pGreen vector (gift of Patrick 
Laufs, INRA), generated by Xbal digestion, into the blunted HindIII sites of the 
binary vector pBinSRNACatN. The StSP6A coding region was then introduced in 
this destination vector by LR Clonase recombination. A list of the primer sets used 
to generate these constructs is shown in Supplementary Table 3. 
Real-time and semiquantitative RT-PCR analyses. Expression of potato FT- 
and TFL1-like family members was quantified by real-time PCR. First-strand com- 
plementary DNA was synthesized from 2 j1g total RNA and 1 il of the reaction used 
for real-time gene expression analysis with the SYBR Green PCR master mix 
(Applied Biosystems). Quantitative PCR was performed using the Power SYBR 
Green PCR Master Mix (Applied Biosystems) on an ABI7500 Real-Time PCR 
System (Applied Biosystems), following the manufacturer’s instructions. Primer 
pairs were specifically designed for each gene using PRIMER EXPRESS 3.0 software 
(Applied Biosystems) and probed for high-efficiency amplification under standard 
quantitative PCR conditions. All reactions were carried out at least in two inde- 
pendent biological replicates. The Pfaffl** method was used for relative quantifica- 
tion of gene expression. Direct 2~ACr where AC; = Cy (target gene) — Cy (actin) 
(ref. 33), was used to generate the tissue-specific heat map in Fig. le. The compar- 
ative critical threshold (2~44@) method*! was used to analyse relative Hd3a and 
StSP6A expression levels (Supplementary Figs 1 and 10). 

Identical procedures were used for semiquantitative RT-PCR, except that amp- 
lification was conducted in a Peltier thermal cycler (PTC-200, MJ Research) and 
one-tenth of the complementary DNA reaction was used for actin amplification. 
For sets of specific primers and product lengths, see Supplementary Table 3. 
Microarray sample hybridization and analysis. Stolons of wild-type plants were 
collected in long days and two, six and eight days after transferring the plants to 
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short days. Time-course profiling analyses of short-day-induced stolons were 
performed as described previously”. 

Arrays of StSP6A overexpression versus wild type (sampled in long days) and 
StSP6A RNAi stolons versus wild type (sampled in short days, day 6) were per- 
formed with samples from three independent lines. Samples were hybridized 
against POCI (Potato Oligo Chip Initiative) microarrays, and background correc- 
tion and normalization were performed using LIMMA**”. The obtained data was 
statistically checked (false discovery rate, <0.05) and genes with a log ratio change 
of +2 or —2 for StSP6A RNAi and +1.8 or —1.8 for StSP6Aox plants were 
selected. The hierarchical cluster of genes present in both experiments was calcu- 
lated, compared with the short-day profile and represented using the TIGR MEV 
free software (http://www.tm4.org/mev/). Genes found to be differentially 
expressed are listed in Supplementary Table 1. 

Immunoblot analysis. Total protein extracts from stolon tissues were obtained in 
lysis buffer (50mM Tris-HCl, pH 7.5, 150mM NaCl, 1% Na deoxycholate, 0.5% 
Triton X-100, 1mM PMSF and protease inhibitors). For analysis of protein graft 
transmissibility, total extracts were incubated overnight with 10 kl of an anti-GFP 
affinity matrix (MBL). After extensive washing, unbound and bound proteins were 
separated by 10% SDS-polyacrylamide gel electrophoresis, blotted onto nitrocellulose 
and probed with an anti-GFP antibody (Roche). Immunoreactive proteins were 
detected with the SuperSignal West Pico Chemiluminescence kit (Pierce). 
Microscopy. GFP fluorescence was observed on longitudinal stolon sections 
(150 um) obtained with a vibratome (PELCO 101). Fluorescence was excited with 
a 488-nm argon laser and emission images were collected in the 500-600-nm 
range using a Leica TCS SP5 spectral confocal microscope. 

For scanning electron microscopy, potato stolons and apical meristems were 

frozen in an Oxford CT 1500 cryosystem (Oxford Instruments), sublimated under 
vacuum and observed using a DIOL JSM 5410 electronic microscope operating at 
10KV. 
Phylogenetic analysis. Full-length protein sequences, except sequences of the 
potato FT- and TFL1-like family members, were obtained from GenBank. 
Sequences of potato FT- and TFL1-like family members were obtained from the 
Potato Genome Sequencing Consortium Data Release”® (http://potatogenomics. 
plantbiology.msu.edu/) by TBLASTX using as a query previously described tomato 
sequences’” (SISP, no. U84140; SISP6A, no. AY186737; SISP3D, no. AY186735; 
SISP9D, no. AY186738; SISP2I, no. AY186734; and SISP5G, no. AY186736). The 
best genome matches (Supplementary Table 2) were downloaded and open reading 
frames were predicted using FGENESH (http://linux1 .softberry.com/berry.phtml) 
and ORF Finder (http://www.ncbi.nlm.nih.gov/projects/gorf/). A counterpart to 
SISP2I was not found in the potato genome. Phylogenetic analyses were conducted 
using MEGA4**. Sequences were aligned with the COBALT program (http:// 
www.ncbi.nlm.nih.gov/tools/cobalt/) and their evolutionary relationship inferred 
using the neighbour-joining method”. All ambiguous positions were removed for 
each sequence pair. The evolutionary distances were computed using the Poisson 
correction method and the bootstrap consensus tree was inferred from 1,000 repli- 
cates. The accession numbers for the corresponding genes are indicated in the tree 
(Supplementary Fig. 2). 
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Structure of human mitochondrial RNA polymerase 


Rieke Ringel’, Marina Sologub”, Yaroslav I. Morozov’, Dmitry Litonin'*, Patrick Cramer’ & Dmitry Temiakov” 


Transcription of the mitochondrial genome is performed by a 
single-subunit RNA polymerase (mtRNAP) that is distantly related 
to the RNAP of bacteriophage T7, the polI family of DNA poly- 
merases, and single-subunit RNAPs from chloroplasts’ *. Whereas 
T7 RNAP can initiate transcription by itself, mtRNAP requires the 
factors TFAM and TFB2M for binding and melting promoter 
DNA*”. TFAM is an abundant protein that binds and bends pro- 
moter DNA 15-40 base pairs upstream of the transcription start 
site, and stimulates the recruitment of mtRNAP and TFB2M to 
the promoter*’. TFB2M assists mtRNAP in promoter melting 
and reaches the active site of mtRNAP to interact with the first base 
pair of the RNA-DNA hybrid”. Here we report the X-ray structure 
of human mtRNAP at 2.5A resolution, which reveals a T7-like 
catalytic carboxy-terminal domain, an amino-terminal domain that 
remotely resembles the T7 promoter-binding domain, a novel 
pentatricopeptide repeat domain, and a flexible N-terminal exten- 
sion. The pentatricopeptide repeat domain sequesters an AT-rich 
recognition loop, which binds promoter DNA in T7 RNAP, 
probably explaining the need for TFAM during promoter binding. 
Consistent with this, substitution of a conserved arginine residue 


in the AT-rich recognition loop, or release of this loop by deletion of 
the N-terminal part of mtRNAP, had no effect on transcription. The 
fingers domain and the intercalating hairpin, which melts DNA in 
phage RNAPs, are repositioned, explaining the need for TFB2M 
during promoter melting. Our results provide a new venue for the 
mechanistic analysis of mitochondrial transcription. They also indi- 
cate how an early phage-like mtRNAP lost functions in promoter 
binding and melting, which were provided by initiation factors in 
trans during evolution, to enable mitochondrial gene regulation 
and the adaptation of mitochondrial function to changes in the 
environment. 

We crystallized a fully functional variant of a recombinant human 
mtRNAP (residues 105-1230) that requires the presence of both 
TFAM and TFB2M for efficient transcription initiation on a double- 
stranded promoter DNA (Supplementary Fig. 1). The structure was 
determined at 2.5 A resolution by molecular replacement with the use 
of a truncated T7 RNAP structure as a search model’, and was refined 
to a free R-factor of 0.23 (Methods, and Supplementary Table 1). 

The mtRNAP structure has the shape of a right hand with palm, 
fingers and thumb subdomains, characteristic of the pol A family of 
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Figure 1 | Crystal structure of human mtRNAP. a, Two views of a ribbon 
model with the major domains and structural elements indicated. The CTD 
that is conserved in all single-stranded RNAPs is in dark grey, the NTD in silver, 
the PPR domain in blue, and the N-terminal extension helix in sand. The active 
site is indicated by a magenta sphere for a modelled catalytic metal ion. 

b, Schematic comparison of mtRNAP with T7 (PDB 1QLN) RNAP. Prominent 
structural elements are indicated. mtRNAP-specific residues 1-368 include the 
mitochondrial targeting signal, the N-terminal extension and the PPR domain. 


Regions in mtRNAP that are not visible in the crystal structure include residues 
592-602, 736-769 and 1086-1105. c, Superimposition of the active-centre 
regions of mtRNAP (grey) and the T7 RNAP initiation complex (PDB 1QLN, 
light pink) reveals a clenched conformation due to a roughly 25° rotation of the 
fingers subdomain around the O-helix axis. Template DNA is in blue, RNA in 
red. Conserved catalytic residues, residues implicated in NTP binding, and a 
sulphate ion are shown with sticks. 
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nucleotidyltransferases (Fig. 1). We use here the canonical polymerase 
domain nomenclature and the previously defined names for functional 
elements in T7 RNAP*"’, even though these have functional implica- 
tions that are not borne out in mtRNAP. The highly conserved palm in 
the C-terminal domain (CTD) superimposes well on the correspond- 
ing palm in T7 RNAP (root mean squared deviation 1.0 A, 121 back- 
bone atoms). The marked conservation of the active centre, including 
the O helix, suggests that the mechanisms of substrate binding and 
selection are conserved between mtRNAP and T7 RNAP!*”*, The 
O helix binds a sulphate ion with its residues R987, K991 and K853 
(Fig. 1c) near the position of the phosphate groups of an incoming 
nucleoside triphosphate (NTP) in the T7 RNAP elongation com- 
plex'*’*. A portion of the thumb (residues 736-769) and the flexible 
specificity loop (residues 1086-1105) are not visible in the structure, 
but adjacent elements occupy almost identical positions in T7 RNAP, 
indicating that these elements have similar orientations and functions. 


The most notable difference between the CTD of T7 RNAP and 
mtRNAP is the position of the fingers subdomain, which is rotated as a 
rigid body by roughly 25° approximately around the axis of the O helix, 
accompanied by a tilting of the O and Y helices (Fig. 1c). These move- 
ments result in a 15 A translation of the N-terminal part of the Y helix 
towards the promoter-binding domain (PBD). The observed position 
of the fingers domain is distinct from the previously observed positions 
in the ‘closed’ elongation-complex structure of T7 RNAP, in which the 
O helix is positioned to deliver the NTP into the active site’* (Fig. 1c). 
We refer to this novel RNAP conformation as ‘clenched’. The clenched 
conformation is unlikely to occur during the nucleotide addition cycle, 
because the entrance to the active site may be occluded by the con- 
served Y-helix residue Y1004, which was implicated in downstream 
duplex melting’*"* and is partly mobile in our structure. It is unclear 
whether the clenched conformation is functional or isa result of crystal 
packing. 
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Figure 2 | Variation in RNAP N-terminal domains. a, Sequence alignment 
and structural conservation of human mtRNAP and T7 RNAP (PDB 1QLN). 
Secondary-structure elements are indicated above and below the sequences for 
mtRNAP (residues 368-654) and T7 RNAP, respectively (cylinders, «-helices; 
arrows, B-strands; lines, loops). The PBD is in pink, the C helix in yellow, and 
subdomain H in green. The AT-rich recognition loop (orange) and the 
intercalating hairpin are indicated. Identical residues are highlighted in red. 
Dashed lines indicate regions that are not visible in the mtRNAP structure. 
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b-d, Comparison of the NTDs of mtRNAP (b) with initiation complexes of T7 
RNAP (PDB 1QLN) (c) and N4 RNAP (PDB 3Q24) (d). Structures are aligned 
with respect to their conserved palm subdomains. Colour code and residue 
borders are as in a. RNA is in red, template DNA in blue, and a non-template 
strand in cyan. Of the CTD (grey or light pink) only the palm subdomain 
(residues 790-831, 911-937 and 1125-1178) and the thumb (residues 677- 
790) are shown. 
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The large N-terminal region of mtRNAP (residues 1-647; Fig. 1b) 
shares no sequence homology with T7 RNAP. Nevertheless, the 
C-terminal part of this region (residues 369-647) is structurally similar 
to the N-terminal domain of phage RNAPs, and we therefore refer to it 
as the N-terminal domain (NTD). In particular, the six-helix bundle of 
the PBD in phage RNAPs, which includes helices D, E, F, G, land J, has 
a counterpart in mtRNAP (Fig. 2). However, there are tangible changes 
in the orientation of the PBD helices between mtRNAP and T7 RNAP 
(root mean squared deviation 9 A; 109 backbone atoms). In com- 
parison with T7 RNAP, the N-terminal part of the PBD (residues 
425-519) is rotated by 32°, whereas the C-terminal part (residues 
567-654) is rotated by 24° in mtRNAP. 

The structure further reveals the loops corresponding to the inter- 
calating hairpin and the AT-rich recognition loop of T7 RNAP. The 
hairpin connects the conserved helices I and J and has a 14-residue 
insertion that is partly visible in the mtRNAP structure (Fig. 2a). The 
loop that connects helices D and E of the PBD corresponds to the AT- 
rich recognition loop but is shorter, whereas the D helix is extended by 
comparison with T7 RNAP (Fig. 2b, c). Another significant difference 
in the NTD of mtRNAP is the alternative fold of the region at the 
position of subdomain H in T7 RNAP. It is likely that this region of the 
mtRNAP NTD undergoes refolding during the transition from ini- 
tiation to elongation, to form a part of an RNA exit pore, as described 
for T7 RNAP’*"*"*. It remains to be seen whether the observed con- 
formation is specific for mtRNAP or whether it also occurs in T7 
RNAP but has not yet been observed. In phage N4 RNAP, which is 
distantly related to T7 RNAP, subdomain H is absent (Fig. 2d). 

The NTD of mtRNAP is connected to a unique helical domain 
(residues 218-368) through an extended, nearly linear, proline-rich 
linker (Figs 1 and 3). This domain consists of nine o-helices, of which 
four comprise two pentatricopeptide repeat (PPR) motifs found in 
plant and mitochondrial proteins'®"*. We refer to this domain as 
the PPR domain. PPR-motif-containing proteins are apparently 
involved in RNA editing and processing’’”°, but their structures are 
unknown. Our structure of the PPR domain shows that the conserved 
residues of the PPR motif form the hydrophobic core of a helix—turn- 
helix fold and the interface between antiparallel «-helices (#D’ and 
aE’, aE’ and oF’, xF’ and «G’) and parallel helices (xE’ and «G’) 
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(Fig. 3a, b). The PPR domain interacts tightly with the NTD. Helices 
oA’ and oB’ form a large hydrophobic interface (about 950 A’) that 
includes the tip of the AT-rich recognition loop and its flanking helices 
aE and &D (Fig. 3c). In addition, a salt bridge between R225 and E574 
and numerous hydrogen bonds stabilize the PPR-NTD interface 
(Fig. 3c). 

The region located N-terminal of the PPR domain, called here the 
N-terminal extension, is mostly mobile, except for a single unassigned 
a-helix that docks to a hydrophobic pocket formed by «F’ and «G' of 
the PPR domain (Fig. 3b). The N-terminal extension of human 
mtRNAP is required for function, because deletion of the N-terminal 
200 residues results in a truncated polymerase that is catalytically active 
but unable to initiate promoter-directed transcription (Supplementary 
Fig. 1). 

Footprinting experiments suggested that during transcription ini- 
tiation mtRNAP (in complex with TFB2M) and T7 RNAP occupy 14 
and 17 base pairs, respectively, of upstream promoter DNA”"® (Fig. 4a). 
The rest of the upstream promoter region in mammalian mtDNA (—35 
to —15) is covered by TFAM’, suggesting that mtRNAP and T7 RNAP 
use different modes of promoter binding. To explore this we modelled 
the putative trajectory of upstream promoter DNA in mtRNAP by 
superimposing the palm subdomains of mtRNAP and the T7 RNAP 
initiation complex” (Fig. 4b). In the resulting model, which remains 
tentative because DNA binding can cause structural changes, upstream 
promoter DNA runs along the NTD, and the template single strand 
descends into the mtRNAP active site (Figs 1c and 4b). The tip of the 
specificity loop may reach into the major groove of promoter DNA and 
read the sequence between —6 and —11 base pairs while its base may 
interact with the phosphodiester backbone of a single-stranded DNA, 
essentially as in phage RNAPs*'. This is consistent with a functional 
analysis of point mutations in the specificity loop of yeast mtRNAP”. 

Modelling also reveals structural differences in mtRNAP, relative 
to T7 RNAP, that may render mtRNAP reliant on the initiation factor 
TFB2M and responsive to activation by TFAM. In particular, the 
AT-rich recognition loop has distinct structures and functions in the 
two systems. In comparison with T7 RNAP, helix D is rotated away 
from the minor groove of DNA by 45° and the tip of the AT- 
rich recognition loop is shifted by 22 A (Fig. 4c) and sequestered by 
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Figure 3 | PPR domain. a, Sequence alignment of human mtRNAP PPR 
domain, including PPR motifs 1 and 2, with predicted PPR motifs in mtRNAPs 
of different species (lanes 2-4), and chloroplast-targeting proteins of plants 
(lines 5,6). Abbreviations: H.s., Homo sapiens; M.m., Mus musculus; X.L., 
Xenopus laevis; D.m., Drosophila melanogaster; Z.m., Zea mays; A.t., 
Arabidopsis thaliana. Residues that are identical, highly conserved, conserved, 
and distinct, are highlighted in red, orange, yellow, and green, respectively. 
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b, Structure of PPR domain (blue) with PPR motifs 1 (light cyan) and 2 (light 
blue). Each PPR motif shows a helix—turn-helix fold. Helices ~A’ and oB’ are 
adjacent to the NTD (silver) and interact with the AT-rich recognition loop 
(orange). PPR motif 1 interacts with a helix in the N-terminal extension (sand). 
c, Interface between PPR domain and NTD. Colours are as in a. Interface 
residues are shown in stick representation. 
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Figure 4 | Promoter binding and melting by mtRNAP. a, Differences in 
topology of the initiation complexes of T7 and mammalian mitochondrial 
transcription machineries, based on foot-printing data and functional assays. 
b, Modelled location of upstream promoter DNA on mtRNAP, based on a 
superimposition with the T7 RNAP initiation complex (PDB 1QLN). mtRNAP 
is depicted as a molecular surface. Loops in mtRNAP that interact with DNA in 
the T7 system are indicated. The portion of the specificity loop not visible in the 
structure is modelled based on the corresponding T7 loop (dashed line). The 
AT-rich recognition loop is buried by the PPR domain (blue) and not seen. The 
intercalating hairpin (rose) is shifted by 7 A compared with the T7 RNAP 
initiation complex, and clashes with the template DNA strand (blue). A clash is 
also observed between the fingers subdomain and the +1 template base. The 
presumed binding site of TFB2M is indicated by a dash oval. c, Relative 


extensive interactions with the PPR domain (Fig. 3c). Consistent with 
this, amtRNAP variant that lacks the PPR domain, and should contain 
a free AT-rich recognition loop, cannot initiate transcription (Sup- 
plementary Fig. 1). In addition, the positively charged residues import- 
ant for promoter interactions by T7 RNAP are not conserved in the 
mtRNAP loop, and mutations R458A/R464A have no effect on tran- 
scription activity (Supplementary Fig. 2). These results indicate that 
human mtRNAP does not use its AT-rich recognition loop for pro- 
moter binding, explaining why it depends on transcription factors for 
initiation. The AT-rich recognition loop also has a different structure 
and function in RNAP of phage N4, where it recognizes a hairpin 
promoter” (Fig. 2d). 

mtRNAP shows two additional structural differences relative to T7 
RNAP that explain why it can not melt DNA and requires TFB2M for 
promoter melting. First, the position of the intercalating hairpin is not 
compatible with promoter melting as observed in T7 RNAP 
(Fig. 2b, c). The tip of the loop is translated 7 A away from its position 
in T7 and clashes with the template strand of DNA (Fig. 4b). Second, a 
25° rotation of the fingers domain towards the NTD (Fig. 1c) blocks 
access of the single-stranded DNA to the active site. 

Two mechanisms of promoter melting may be considered. First, 
binding of TFB2M may reposition the intercalating hairpin so that it 
can function as in T7 and N4 RNAP’. Second, DNA melting may be 
achieved not by the intercalating hairpin but by an unidentified struc- 
tural element in TFB2M. To investigate this we constructed variants of 
mtRNAP lacking either five residues at the tip of the intercalating 
hairpin (A613—617) or lacking a preceding insertion (A591-601). 
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orientation of the PBDs of mtRNAP (silver) and T7 RNAP initiation complex 
(PDB 1QLN, salmon) based on homology modelling of the corresponding 
palm subdomain. The AT-rich recognition loop (orange) is shifted and rotated 
by 45°. d, An intercalating hairpin region is required for promoter melting in 
mtRNAP. Transcription run-off assays with wild-type (WT) and mutant 
RNAPs were performed with bubble (left panel) and double-stranded (right 
panel) light-strand promoter templates for 30 min at 35 °C. Transcription 
factors TEAM (50 nM) and TFB2M (150 nM) were added to reactions that 
involved double-stranded template (right panel), whereas transcription with 
the bubble template was factor-independent (left panel). Reaction products 
were resolved with 20% PAGE containing 6 M urea, and analysed with a 
PhosphorImager. nt, nucleotides. 


These mutant polymerases had an activity similar to that of the 
wild-type enzyme ona pre-melted ‘bubble’ light-strand promoter tem- 
plate (Fig. 4d) but were essentially inactive on a double-stranded light- 
strand promoter template, suggesting a role of the intercalating hairpin 
in DNA melting (Fig. 4d). To explore this further, we built a model ofa 
mitochondrial transcription initiation complex, which takes available 
structural, biochemical and genetic data’®**-”” into account (Sup- 
plementary Fig. 3). The model remains tentative but indicates that 
TFB2M may insert between the intercalating hairpin and thumb sub- 
domain, to reposition the intercalating hairpin and the fingers, such 
that they function in promoter melting (Supplementary Fig. 4). 
TFB2M may also reach near the active site to stabilize an early tran- 
scribing complex by binding the template strand and incoming NTP as 
suggested”®. 


METHODS SUMMARY 


Human mtRNAP, TFB2M and TFAM were expressed and purified as described"”. 
The coding sequences of human mtRNAP N-terminal deletion mutants A104 
(residues 105-1230), A200 (201-1230) and A368 (369-1230) were amplified by 
PCR and cloned into vector pProEx(Hb) (Invitrogen), to express N-terminally 
hexahistidine-tagged proteins. Deletion mutants A613-617 and A591-601GG and 
substitution mutant R458A/R464A were obtained by site-directed mutagenesis 
(Quick-change; Stratagene). Transcription run-off reactions were performed with 
synthetic DNA templates as described’? (Supplementary Fig. 1). For crystalliza- 
tion, A104 mtRNAP was treated in situ with ArgC protease (1:1,000 w/w) for 1-2h 
at 23 °C. Initial crystals were obtained in hanging drops over 12% poly(ethylene 
glycol) (PEG) 4000, 10% glycerol, 30 mM MES, 60 mM (NH,)2SO,, 40 mM Tris- 
HCl pH 8.0, 20mM dithiothreitol at 20°C, and used for micro-seeding, which 
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produced rod-shaped crystals of up to 0.3 X 0.1 X 0.1 mm? in size. Diffraction data 
were collected in 0.5° increments at 100 K ona MarCCD detector at the Swiss Light 
Source in Villigen, Switzerland (Supplementary Table 1). Data were integrated 
with MOSFLM”* and scaled with SCALA”. The structure was solved by molecular 
replacement using the T7 RNAP structure’ (PDB 1QLN) truncated to polyalanine 
and reduced to residues 246-354, 392-568, 691-741 and 766-883. The replace- 
ment solution was subjected to rigid-body refinement with Refmac-5 (ref. 30). The 
model was iteratively built with COOT and refined with autoBuster (Global 
Phasing) to a final free R-factor of 23%. The refined structure includes residues 
218-591, 603-611, 616-735, 770-1085 and 1109-1230. Residues 326-360 and the 
helix in the N-terminal extension were built as polyalanine because of poor side- 
chain density. Figures were prepared with Pymol (Schrédinger). 
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Primary motor cortex underlies multi-joint 
integration for fast feedback control 


J. Andrew Pruszynski!”*, Isaac Kurtzer>**, J oseph Y. Nashed!, Mohsen Omrani!, Brenda Brouwer!* & Stephen H. Scott!>® 


A basic difficulty for the nervous system is integrating locally 
ambiguous sensory information to form accurate perceptions 
about the outside world’~*. This local-to-global problem is also 
fundamental to motor control of the arm, because complex mech- 
anical interactions between shoulder and elbow allow a particular 
amount of motion at one joint to arise from an infinite combination 
of shoulder and elbow torques’. Here we show, in humans and rhesus 
monkeys, that a transcortical pathway through primary motor cor- 
tex (M1) resolves this ambiguity during fast feedback control. We 
demonstrate that single M1 neurons of behaving monkeys can integ- 
rate shoulder and elbow motion information into motor commands 
that appropriately counter the underlying torque within about 50 
milliseconds of a mechanical perturbation. Moreover, we reveal a 
causal link between M1 processing and multi-joint integration in 
humans by showing that shoulder muscle responses occurring ~50 
milliseconds after pure elbow displacement can be potentiated by 
transcranial magnetic stimulation. Taken together, our results show 
that transcortical processing through M1 permits feedback res- 
ponses to express a level of sophistication that rivals voluntary con- 
trol; this provides neurophysiological support for influential 
theories positing that voluntary movement is generated by the 
intelligent manipulation of sensory feedback’. 

Extensive research has shown that some of our fastest motor re- 
actions express a degree of sophistication that rivals voluntary 
actions*”°, but little is known about the neural substrates that underlie 
this sophistication’'. The present monkey and human studies test the 
hypothesis that primary motor cortex (M1) provides a neural substrate 
for integrating shoulder and elbow motion information for fast feedback 
control, a key ability for generating fast and accurate corrections*'*”. 
Misa prime candidate to mediate this ability because: (1) it forms part 
of a transcortical feedback pathway, giving it access to the required 
afferent information’*"*; (2) it is a key node for voluntary control, which 
appropriately incorporates shoulder and elbow information when 
generating commands for voluntary actions'*’’; and (3) influential 
theories posit that voluntary movement involves the sophisticated 
manipulation of sensory information’, suggesting substantial functional 
and anatomical overlap between voluntary and feedback control’. 

We first determined whether individual neurons in monkey M1 
exhibit a pattern of activity consistent with multi-joint motion integ- 
ration during fast feedback control. Two male rhesus monkeys were 
trained to counter unpredictable step-torque perturbations applied at 
the shoulder and/or elbow which displaced their hand from a central 
target. To receive water reward, the monkeys needed to return their 
hand to the target within 750 ms and remain within it for an additional 
3 s, allowing us to analyse both fast feedback responses (<100 ms post- 
perturbation) and steady-state motor outputs (last 1 s of stabilization) 
produced in response to the applied torque within the same trial. 

Because our experimental scheme (Fig. 1a) was specifically designed 
to examine sensorimotor control of the shoulder joint, we were 


principally interested in neurons whose steady-state motor outputs 
varied with the exerted shoulder torque (that is, shoulder-like neu- 
rons). As in our previous studies, we found that the population of 
neurons was biased towards combined shoulder and elbow torques 
(Rayleigh test for bimodality, P< 0.05), making shoulder-like neurons 
relatively rare’’. In total, 25 of 356 M1 neurons were categorized as 
shoulder-like because they exhibited significant directional tuning to 
steady-state loads (plane-fit, P< 0.05) and a preferred torque direction 
within 15° of either shoulder flexion or shoulder extension torque 
(Supplementary Fig. 1). 

The key question is how quickly shoulder-like neurons become 
selectively tuned to shoulder torque following an unexpected torque 
perturbation. This is not a mere restatement of our selection criteria 
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Figure 1 | Experimental methods. a, Because of the mechanical properties of 
the limb, an infinite combination of possible shoulder and elbow torque 
perturbations can cause the same resultant shoulder motion. Generating the 
shoulder motor response which counters the applied shoulder torque 
perturbation requires integrating elbow information. b, Limb configuration 
before (unfilled) and after (filled) a torque perturbation was applied at either the 
shoulder (t,) or elbow (t,) as in experiment 1. Opposite conditions (shoulder- 
extensor / elbow-flexor torque) not shown. c, Joint displacement resulting from 
the shoulder (red) and elbow (blue) perturbation conditions in b. The 
perturbations yielded similar shoulder motion but substantially different elbow 
motion. Solid lines represent the mean displacements and the grey lines show 
individual trials. d, Limb configuration before and after a multi-joint flexion or 
multi-joint extension torque perturbation as in experiment 2. e, The 
perturbations shown in d caused substantial elbow motion but almost no 
shoulder motion. 
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because local shoulder information is sufficient for countering the 
underlying shoulder torque in the steady-state. In contrast, the only 
way that fast feedback responses can account for the ambiguous rela- 
tionship between local joint motion and global torque is by integrating 
information from both the shoulder and elbow. 

The need to resolve ambiguous motion information for fast feed- 
back control is exemplified in our first experiment, where we applied 
either shoulder torque or elbow torque perturbations (Fig. 1b). These 
perturbations caused substantially different amounts of elbow motion 
but nearly identical shoulder motion (Fig. 1c). If shoulder-like neurons 
integrate both shoulder and elbow motion information, then they 
should differentiate between the two conditions and respond more 
strongly to the shoulder torque perturbation than the elbow torque 
perturbation. Figure 2a presents an exemplar neuron which follows 
this pattern. It was maximally active during steady-state compensation 
of shoulder-extension torque, and it responded more strongly to the 
shoulder-extension torque perturbation than the elbow-flexion torque 
perturbation (t-test, tg = 2.2, P<0.05) within ~60 ms of perturba- 
tion onset. Moreover, the population of shoulder-like neurons (Fig. 2b, 
Supplementary Fig. 2a) also quickly expressed greater activity for 
shoulder torque perturbations than elbow torque perturbations 
(paired t-test, to4 = 2.7, P< 0.01; 15 out of 20 neurons in monkey P 
and 4 out of 5 neurons in monkey X show the expected trend). 

Our hypothesis makes the additional prediction that different 
amounts of inhibition should occur to torque perturbations that are 
opposite to a neuron’s steady-state preference. That is, a neuron 
which is maximally active during steady-state compensation of 
shoulder-extension torque should quickly express more inhibition to 
shoulder-flexion torque perturbations than to elbow-extension torque 
perturbations. This prediction was verified across the population 
(paired t-test, t,4 = 2.1, P< 0.05), demonstrating that shoulder-like 
neurons possess a pattern of multi-joint integration appropriate for 
both excitatory and inhibitory perturbations (Supplementary Fig. 2a). 
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Figure 2 | Neurons in primary motor cortex. a, Responses of an exemplar 
shoulder-like neuron to either an elbow (blue) or shoulder (red) torque 
perturbation in experiment 1 where shoulder or elbow torque perturbations 
yielded the same shoulder motion. Data aligned on perturbation onset. Dots 
represent single action potentials and the trace depicts the average response. 
b, Same format as a but representing the population response across shoulder- 
like neurons. The arrows indicate when neural activity is non-specific to the 
underlying shoulder torque perturbation and then when it begins to respond 
appropriately to the underlying shoulder torque. c, d, Same format as a and 
b but for experiment 2 where combined shoulder and elbow torque 
perturbations yielded no shoulder motion. 
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Another situation where the nervous system must resolve locally 
ambiguous information is exemplified in our second experiment, in 
which torque perturbations at both the shoulder and elbow cause 
substantial elbow motion but no shoulder motion (Fig. 1d,e). If fast 
feedback responses of shoulder-like neurons appropriately integrate 
shoulder and elbow motion to counter the underlying torque then they 
should respond to this perturbation, even though the shoulder joint is 
not displaced and no local shoulder sensor (in the muscle, joint or skin) 
can signal the event. Indeed, the exemplar neuron (Fig. 2c) increased 
its activity within ~60 ms of pure elbow extension motion, which is 
appropriate for countering the underlying shoulder extensor torque 
and consistent with its response in our first experiment (f-test, 
tig =5.1, P<10 *). The population of shoulder-like neurons also 
quickly expressed the predicted response pattern (Fig. 2d, Supplemen- 
tary Fig. 2b; paired t-test, th4 = 4.4, P< 10°; 15 out of 20 neurons in 
monkey P and 4 out of 5 neurons in monkey X show the expected 
trend). 

The above analysis established that M1 neurons integrate shoulder 
and elbow motion to counter the underlying torque perturbation 
within a binned epoch from 50 to 100 ms after perturbation onset 
(Fig. 3a). The response of shoulder-like neurons in this epoch 
paralleled the response of monkey shoulder muscles in the same epoch 
(Fig. 3b), suggesting that M1 contributes to the observed muscle activity. 
To provide further evidence of a functional link, we calculated the 
temporal evolution of multi-joint integration for both neurons and 
muscles using a receiver operating characteristic (ROC) analysis”. 
We found that multi-joint integration occurred in the population of 
MI neurons 8-20 ms before it occurred in muscles (Fig. 3c). Although 
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Figure 3 | Population analysis of neurons and muscles. a, Binned response 
(50-100 ms post-perturbation) across shoulder-like neurons. For Experiment 1, 
the red and blue bars represent responses to shoulder and elbow torque 
perturbations, respectively. For Experiment 2, the red and blue bars depict 
responses to pure elbow motion caused by a torque perturbation aligned with or 
opposite to the neuron’s steady-state preference, respectively. Error bars, s.e.m.; 
*significant differences between conditions (paired t-test, P< 0.05); 
tsignificant differences from baseline; NS, not significant. b, Same format as 
a but for the population of muscles. Because of the normalization procedure, 
muscle baseline activity is 0 arbitrary units (a.u.). c, Average discrimination 
probability over time for the population of neurons and muscles as determined 
by ROC analysis. Conditions are collapsed across experiments such that the 
vertical axis is a metric of multi-joint integration. On average, the neurons led 
the muscles by ~19 ms as estimated by the mean temporal difference between 
the neural and muscle ROC curves (AT) from 50 to 100 ms post-perturbation 
(shaded area). d, Same format as Fig. 2d but for elbow-like neurons. 


©2011 Macmillan Publishers Limited. All rights reserved 


this temporal lead is substantially shorter than typical measurements of 
M1-to-muscle latency at the initiation of voluntary movement 
(~60 ms)”, it is consistent with the known conduction delay between 
MI neurons and muscles of the monkey upper-limb*”’. 

Interestingly, shoulder-like M1 neurons did not immediately 
account for the limb’s mechanical properties, displaying a non-specific 
response to the torque perturbations from ~20 to 50 ms following 
perturbation onset (Fig. 2b, d). A similar non-specific response was 
observed across our whole population of M1 neurons (Supplementary 
Fig. 2c,d). For example, neurons which preferentially responded to 
steady-state elbow torque (elbow-like neurons, Supplementary Fig. 1) 
initially exhibited the same response whether the perturbation flexed or 
extended the elbow joint as in our second experiment (Fig. 3d). This 
non-specific response is strikingly similar to a population of neurons in 
primary visual cortex (V1), which initially respond ambiguously to 
objects placed in their receptive field and become sensitive to motion 
direction only after 20-30 ms (ref. 2), a delay attributed to interactions 
among V1 neurons”. The temporal evolution of multi-joint integration 
that we observe may also reflect processing intrinsic to M1 or it may be 
caused by delayed contributions from other neural structures, such as 
somatosensory cortex and cerebellum; this is an important issue that 
warrants further investigation. 

Although the activity of single neurons in monkeys provides 
evidence that M1 is functionally linked to multi-joint integration for 
fast feedback control, the above data are ultimately correlational and 
cannot establish whether M1 causes the co-varying pattern of shoulder 
muscle activity. We addressed this issue by directly influencing the 
processing of M1 in human participants while they generated feedback 
corrections similar to those in the monkey study. Applying a single 
pulse of transcranial magnetic stimulation (TMS) over M1 will excite 
its intrinsic circuits and, thereby, evoke a burst of muscle activity. 
When TMS is applied in conjunction with a joint perturbation, the 
response in the stretched muscle is much larger than the linear sum of 
the response to TMS alone and the perturbation alone****. These 
supra-linear effects—previously demonstrated in the finger, wrist 
and elbow muscles—occur only when TMS is timed to evoke a res- 
ponse >50 ms after the perturbation, suggesting that the two stimuli 
interact through a common cortical circuit and that feedback control 
at latencies >50 ms reflects processing in M1. We established the 
validity of this technique for shoulder muscles, as a supra-linear res- 
ponse occurred when TMS was delivered ~65 ms after the shoulder 
muscle was stretched (that is, during the long-latency reflex; t-test; 
extensor: to = 6.7, P< 10°; flexor: tg = 6.0, P< 10 °) but not 
25 ms after the muscle was stretched (that is, during the short-latency 
reflex; extensor: ff = —0.5, P>0.5; flexor: tg =0.5, P>0.5), when 
only spinal processes could contribute (Supplementary Fig. 3, left 
column). 

The critical question is whether M1 causally contributes to multi-joint 
integration for fast feedback control. We tested this hypothesis by apply- 
ing TMS in conjunction with the torque perturbation that causes pure 
elbow displacement (Fig. 1d, e). Any supra-linearity of the shoulder 
muscle response in this condition must reflect afferent information from 
the elbow joint onto cortical circuits controlling shoulder muscles, 
because local shoulder afferents are not physically affected by pure 
elbow motion. The predicted supra-linear effect was observed for both 
shoulder flexors and extensors (Fig. 4, Supplementary Fig. 3, right 
column) with TMS delivered 65 ms after perturbation onset (extensor: 
ty = 3.8, P< 0.01; flexor: tg = 5.3, P<10 °) but not 25 ms after per- 
turbation onset (extensor: tp = —2.7, P>0.5; flexor: tg = —0.1, 
P>0.5). The observed supra-linearity probably reflects latency- 
specific engagement of M1 rather than a general change in motor 
neuron excitability, because we found no correlation between the mag- 
nitude of perturbation-evoked activity and the amount of supra- 
linearity at either latency (P>0.1, Supplementary Fig. 4). Taken 
together, these results provide strong evidence that M1 causally under- 
lies multi-joint integration for fast feedback control. 
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Figure 4 | TMS and perturbation evoked activity in human shoulder 
muscles. a, Response evoked in an exemplar shoulder muscle (posterior 
deltoid) when a mechanical perturbation or TMS was applied in isolation. Note 
that the mechanical perturbation yielded a robust long-latency reflex and that 
the TMS yielded substantial evoked activity. b, Observed response (orange) and 
linear prediction (sum of responses in a, black) when the mechanical 
perturbation and TMS were applied in the same trial. c, Group muscle response 
(mean + s.d.) when TMS was paired with the perturbation normalized by the 
sum of their separate effects (Enorm = Erms,pert/ (Erms + Epert). Values above 1 
indicate supra-linearity; *significant supra-linearity (t-test, P< 0.05). 


Previous studies have demonstrated that fast feedback responses in 
Mare scaled by task-constraints, such as movement amplitude”, sur- 
face texture” and intended vigour'*”*. Our results show that M1 also 
integrates locally-ambiguous motion information into a global response 
that accounts for the limb’s mechanical properties, a more complex 
capability that is central to successfully guiding whole-arm move- 
ments”. It is well established that the voluntary motor system accounts 
for the mechanical properties of the limb and that this capability is 
expressed in the activity of M1 neurons”. We have previously argued 
that the functional similarity of voluntary and feedback control is not 
an accident and probably arises because of a common neural imple- 
mentation that includes M1’. This expectation is consistent with recent 
theories of sensorimotor control, which posit that voluntary behaviour 
involves the sophisticated manipulation of sensory information’. If our 
suggestion is true, then feedback processing in M1 should possess all 
the capabilities of voluntary processing in M1 and, likewise, studying 
feedback processing may provide a useful window into voluntary 
control. 


METHODS SUMMARY 


Studies were approved by the Queen’s University Research Ethics Board and 
Animal Care Committee. Ten human subjects gave informed consent and com- 
pleted the experiments as previously described*. Two monkeys (Macaca mulatta, 
~10 kg) performed a similar experimental scheme in a miniaturized version of the 
same apparatus with ~10X smaller loads (KINARM, BKIN Technologies)”. 
Monkeys did not counter a pre-perturbation background load, and were exposed 
to eight randomly-interleaved step-torque perturbations, four used in experiment 1 
and two used in experiment 2. All eight conditions were used to calculate steady- 
state tuning by performing a planar regression on the neural activity when the 
monkey had re-stabilized its hand at the central target'®. 

Neural recordings were performed with single electrodes and processed accord- 
ing to standard techniques'*”’. Monkey muscle activity was acquired from mono- 
articular shoulder muscles (anterior/middle/posterior deltoid, pectoralis major; 
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n= 34) using fine-wire electrodes. Human experiments used surface electrodes 
(posterior deltoid, pectoralis major; n = 19). Population responses for both neurons 
and muscles were collapsed across shoulder flexion and extension conditions 
according to their predicted excitatory and inhibitory effects (for example, shoulder- 
flexion torque perturbations were excitatory for shoulder extensor muscles/neurons 
and inhibitory for shoulder flexor muscles/neurons). 

Single pulses of TMS (MES-10, Cadwell) were applied over left M1 with a 
posterior orientation of 30-45°. Placement/orientation of the double coil was 
chosen to evoke the largest response from the muscle of interest, ~4.5 cm lateral 
from vertex. Stimulation magnitude (40-50% of maximum) was selected to deliver 
the smallest-possible consistent response (evoked activity on seven consecutive 
trials) when the shoulder muscle countered a steady-state load. TMS only, per- 
turbation only and combined TMS and perturbation trials were randomly inter- 
leaved. TMS was timed to evoke shoulder muscle activity either ~25 ms or ~65 ms 
after perturbation onset. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Participants and apparatus. The studies presented were approved by the Queen’s 
University Research Ethics Board. All monkey ( = 2, Macaca mulatta, ~10kg, 
male) procedures were approved by the Queen’s University Animal Care 
Committee. Human subjects (6 females, 4 males, median age = 27) were neuro- 
logically unimpaired, had normal/corrected vision and provided informed con- 
sent. Human and monkey experiments were performed using different versions of 
the same robotic exoskeleton (KINARM, BKIN Technologies) which allows com- 
bined flexion and extension movements of the shoulder and elbow in the hori- 
zontal plane and can independently apply mechanical loads to the shoulder and/or 
elbow”. Target lights and simulated hand feedback were presented in the hori- 
zontal plane of the task via a virtual reality display, and direct vision of the hand 
was limited either by a physical barrier (humans) or by a lack of ambient light 
(monkeys). 

Neural, muscle and kinematic recordings. Recording chambers were surgically 
implanted under inhalation anaesthetic, and neural recordings were performed 
according to standard techniques'*”’. Single tungsten microelectrodes (FHC) were 
advanced until neural activity was observed. Individual neurons were then isolated 
and neural activity was recorded from those neurons with clear responses to either 
passive or active movements of the shoulder and/or elbow. Neurons which primarily 
responded to motion of the wrist or fingers were not recorded. Neurons recorded in 
the task (n = 356) were located in the rostral bank of the central sulcus as well as 
more superficial sites where previous mapping efforts showed that trains of electrical 
stimulation (11 pulses, 333 Hz, 0.2 ms pulse width, <50 A) could elicit shoulder 
and/or elbow movement. Post-mortem histology confirmed that recording sites 
from monkey P were located in M1. 

Monkey muscle activity was acquired from mono-articular shoulder muscles 
(anterior deltoid, middle deltoid, posterior deltoid, pectoralis major; n = 34) using 
fine-wire electrodes. Electrodes consisted of two single-strand wires and were 
individually inserted into the muscle belly spaced ~5 mm apart. Insertion was 
guided by anatomical landmarks and was confirmed by microstimulation. Human 
experiments used surface electrodes (Bortec AMT-8) and focused on those mono- 
articular muscles which could be readily recorded from the surface (posterior 
deltoid, pectoralis major; n = 19). Muscle activity was recorded at either 4 kHz 
(monkey) or 1 kHz (human), aligned on perturbation onset and full-wave rectified 
before analysis’. Only those muscles with clear phasic responses to the mechanical 
perturbation were analysed. Kinematic data and applied torques were acquired 
directly from the KINARM device and were sampled at the same rate as the muscle 
activity. 

Transcranial magnetic stimulation. We followed standard procedures in the 
TMS portion of the study****. Single pulses of TMS (MES-10, Cadwell) were 
applied over left M1 with a posterior orientation of 30-45°. Placement and orienta- 
tion of the double cone coil were chosen to evoke the largest response from the 
muscle of interest, ~4.5 cm lateral from vertex. Stimulation magnitude was selected 
to deliver the smallest-possible consistent response (evoked response on seven 
consecutive stimulations, average of 40% and 51% of the stimulator’s maximum 
output for the posterior deltoid and pectoral major, respectively) when the muscle 
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of interest actively countered a 3 N m background load (that is, active motor thresh- 
old). 

Experimental scheme. The experimental procedure and logic have been previ- 
ously described*. The major difference in the human portion of this study was the 
parallel implementation of TMS. Briefly, subjects stabilized their hand in a small 
central target while countering a steady state shoulder torque (3 Nm) which 
activated either the shoulder flexor or extensor muscles. After a random hold time 
(1-4), an unpredictable torque pulse (100 ms duration) was introduced and the 
trial ended when the subjects re-stabilized in the target for 500 ms. 

In total, four torque perturbations were used in the human study. Two single- 
joint torque perturbations (3 Nm shoulder-flexion for shoulder flexor muscles 
and 3N m shoulder-extension for shoulder extensor muscles) made up experi- 
ment 1, and two multi-joint torque perturbations (3 N m shoulder-flexion/3 N m 
elbow-flexion for shoulder flexor muscles; 3Nm_ shoulder-extension/3 Nm 
elbow-extension for shoulder extensor muscles) made up experiment 2. 
Perturbation only, TMS only and combined TMS and perturbation trials were 
randomly interleaved. In combined TMS and perturbation trials, the TMS was 
timed to evoke shoulder muscle activity either ~25 ms or ~65 ms after perturba- 
tion onset. 

The perturbation, TMS placement and TMS intensity were chosen for the 
shoulder flexor muscle and shoulder extensor muscle in two successive blocks. 
Half the subjects began with the conditions for the shoulder flexor muscle and half 
the subjects began with the conditions for the shoulder extensor muscle. Thirty 
repeats of the 14 conditions were collected for a total of 420 trials in a session that 
lasted about 2.5 h. 

Monkeys performed a similar scheme with ~10X smaller loads. Unlike 
humans, the monkeys did not counter a pre-perturbation background load and 
they were exposed to eight randomly-interleaved step-torque perturbations 
({shoulder torque, elbow torque], applied flexion/extension = positive/negative: 
1. [0.28Nm, ONm], 2. [0.24, 0.24], 3. [0, 0.24], 4. [—0.2, 0.2], 5. [—0.28, O], 6. 
[-0.24, —0.24], 7. [0, —0.24], 8. [0.2, —0.2]) and catch trials where no perturba- 
tions occurred. Four of these perturbations (1,3,5,7) formed experiment 1 and two 
(2,6) formed experiment 2. All eight conditions were used to calculate the steady- 
state tuning of each neuron by performing a planar regression on the neural 
activity when the monkey had re-stabilized its hand at the central target'*. To 
receive water reward, the monkeys needed to return their hand to the target within 
750 ms and remain within it for an additional 3 s, allowing us to analyse both fast 
feedback responses (<100 ms post-perturbation) and steady-state motor outputs 
(last 1 s of stabilization) to the applied torque within the same trial. Five to twenty 
repeats were collected per experimental condition. 

Population responses for both muscles and neurons were calculated by collaps- 
ing across shoulder flexion and extension conditions according to their predicted 
excitatory and inhibitory effects. That is, applied shoulder-flexion torque pertur- 
bations were excitatory for shoulder extensor muscles/neurons and inhibitory for 
shoulder flexor muscles/neurons. Applied shoulder-extension torque perturba- 
tions were excitatory for shoulder flexor muscles/neurons and inhibitory for 
shoulder extensor muscles/neurons. 
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Theta-paced flickering between place-cell maps in 


the hippocampus 


Karel Jezek!?, Espen J. Henriksen!, Alessandro Treves!’, Edvard I. Moser! & May-Britt Moser! 


The ability to recall discrete memories is thought to depend on the 
formation of attractor states in recurrent neural networks’. In 
such networks, representations can be reactivated reliably from 
subsets of the cues that were present when the memory was encoded, 
at the same time as interference from competing representations is 
minimized. Theoretical studies have pointed to the recurrent CA3 
system of the hippocampus as a possible attractor network**. 
Consistent with predictions from these studies, experiments have 
shown that place representations in CA3 and downstream CA1 
tolerate small changes in the configuration of the environment 
but switch to uncorrelated representations when dissimilarities 
become larger*°. However, the kinetics supporting such network 
transitions, at the subsecond timescale, is poorly understood. Here 
we show in rats that instantaneous transformation of the spatial 
context does not change the hippocampal representation all at once 
but is followed by temporary bistability in the discharge activity of 
CA3 ensembles. Rather than sliding through a continuum of inter- 
mediate activity states, the CA3 network undergoes a short period of 
competitive flickering between preformed representations of the 
past and present environment before settling on the latter. 
Network flickers are extremely fast, often with complete replace- 
ment of the active ensemble from one theta cycle to the next. Within 
individual cycles, segregation is stronger towards the end, when 
firing starts to decline, pointing to the theta cycle as a temporal unit 
for expression of attractor states in the hippocampus. Repetition of 
pattern-completion processes across successive theta cycles may 
facilitate error correction and enhance discriminative power in 
the presence of weak and ambiguous input cues. 

The place-cell population of the hippocampus is thought to create a 
neural representation of the spatial environment'®. Accumulating 
evidence indicates that environments are generally represented in 
hippocampal cells by many discrete maps, each corresponding to a 
distinct environment or a unique experience within the environ- 
ment>**"'"*. Which map is active at any given time depends on external 
sensory inputs as well as recent history”'*. Incongruity between the 
active map and sensory inputs may lead to partial or complete replace- 
ment of the active representation’*"®. The kinetics of map substitutions 
has remained elusive owing to a shortage of experimental and analytical 
tools for subsecond-timescale neural population analyses. 

Here, we developed tools to determine how local network activity 
evolves in the hippocampus in response to sudden changes in the cues 
that define spatial context. Rats with tetrodes in CA3 (Supplementary 
Fig. 1) were first trained on separate trials in two boxes with different 
sets of light cues in a dark room (boxes A and B). The procedure 
favours the development of uncorrelated place representations in A 
and B°” (Supplementary Fig. 2). After several days of training in each 
box, the rats started in one of the environments; then, after 40-60 s, the 
cues were switched instantaneously to those of the other environment, 
effectively ‘teleporting’ the rat from A to B or vice versa. A total of 169 
such trials, hereafter referred to as teleportation trials, were performed, 


with an average of 33 + 3 active cells per day (mean + s.e.m,; total of 
358 active cells; 11 days; 6 rats). 

To examine the evolution of network activity after the cue change, 
we first established expected firing patterns for all locations in each 
environment. Firing rates were determined for each cell in each spatial 
bin of each box on separate reference trials (30 30 bins; Fig. 1a). As 
expected for CA3 place cells’’, the subsets of active cells in the two boxes 
overlapped minimally: cells with activity in both A and B generally fired 
at unrelated positions (spatial correlation: 0.112 + 0.019). The nearly 
orthogonal nature of the baseline representations made it possible, in 
principle, to infer from any subsequent cell sample which of the two 
environments was represented in the hippocampal network at the time 
of recording. Thus, in the next step, we compared the evolution of 
activity over successive theta cycles in the teleportation test with activity 
at corresponding locations in the reference environments. The theta 
phase with the lowest overall firing rate was used to segment the record- 
ing from each teleportation trial into individual theta cycles (Fig. 1b, c 
and Supplementary Fig. 3). A population vector was then constructed 
for each theta cycle, consisting of the number of spikes of each of the C 
simultaneously recorded cells (Fig. 1c). For each theta cycle, the popu- 
lation vector was correlated with vectors of average firing rates (in Hz) 
for the same location in each reference recording. We first used a 
simple linear measure of correlation, the un-normalized dot product 
(DP), between the test population vector r and each of the two 
reference vectors, ra and rp. If rq and rg were strictly orthogonal and 
after teleportation test vectors were to be linear combinations 
r ~ Cara + Cprp, the DP measure would allow a direct estimate of 
the coefficients ca (t) and cg (tf), that is, of the strength with which 
the two representations are relayed to downstream neurons at time ?**. 
Only trials with continuous theta activity were analysed (149 out of 169 
teleportation trials). 

The switching of spatial cues was followed by a characteristic pattern 
of network activity (Fig. 2). Before the transformation, most theta 
cycles correlated strongly with reference vectors for the first envir- 
onment (denoted as I, whether A or B) but not for the second (II), 
as expected (Figs 1d and 2a, b). After cue switching, most theta cycles 
correlated with the reference vectors for II but the magnitude of the dot 
product was variable and the network occasionally ‘flickered’ back to 
strong correlations with I. To investigate the statistics of this flickering, 
we plotted for each theta cycle the correlation of the momentary popu- 
lation vector with the reference vectors from the same environment 
(x-axis) and the alternative environment (y-axis) (Fig. 2c). The analysis 
confirmed that cue switching increased the proportion of cycles cor- 
related with the alternative environment but also identified a subset of 
cycles that correlated modestly with both environments at the same 
time. This led us to ask if the instantaneous activity during the transi- 
tion reflected a simple linear combination of fragments of the A and B 
representations, as would be expected in the absence of population 
coherence. We consequently determined whether, in the pooled data, 
the proportion of theta cycles correlating with both environments at 
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Figure 1 | Procedures for analysing hippocampal transition dynamics. 

a, Stack of firing-rate maps in box A (white floor lights; left) and box B (green wall 
lights; right) for an example set of 32 simultaneously recorded hippocampal CA3 
units. Each map shows a colour-coded distribution of firing rates across the 
square test box (blue, silent; red, maximum). Red line, one of 30 X 30 population 
vectors (PV) constructed from the activity of the entire cell ensemble in a given 
2 X 2cm position bin. Note strong difference in population vectors for A and B. 
b, Theta phase modulation for all pyramidal cells on a representative trial. Spike 
number is shown asa function of theta phase (bin size 10 deg). Dashed green line, 
phase with lowest firing rate, used to define boundary between cycles. 

c, Representative spike distribution across theta cycles in the stable state. Rasters 
of red dots show spike times of individual cells in relation to 6-11 Hz filtered local 
electroencephalogram (EEG; blue). Green lines indicate theta-cycle boundaries 
(b). The ensemble distribution of activity during one cycle represents the 
momentary population vector. d, Dot-product correlation between momentary 
population vector and reference vectors at the corresponding position in A (red) 
and B (blue) during a baseline trial in A. e, Cumulative product between 
correlations with each of the reference environments as a function of the phase for 
segmentation of theta cycles (0, phase of minimum activity). 


the same time (rer, > C, rerg > C, referred to as ‘mixed theta cycles’) 
was lower than expected if each single unit expressed either one or the 
other representation independently of the other units. A total of 1.25% 
of the theta cycles in the recorded data were mixed. This number was 
lower than in 970 out of 1,000 randomly recombined (shuffled) popu- 
lation vectors (that is, P< 0.03; Fig. 2d). The separation between the 
A- and B-correlated representations was strongest when the cycles were 
chunked at the point of the lowest average firing rate in the population 
(Fig. 1b, c, e and Supplementary Fig. 3). Thus, mixed representations 
existed but were rare. Transitions between orthogonal maps tend to 
occur in an all-or-none manner, with the entire network flickering 
coherently at time scales of approximately a tenth of a second. 

We subsequently examined the evolution of network activity within 
the theta cycle. Each cycle was divided into two halves, and mixed 
states were defined as those half-cycles for which rer, > C/2, 
rerz > C/2. Mixed population vectors were less abundant than 
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Figure 2 | Theta cycles correlate with either of the reference environments 
but rarely with both simultaneously. a, Top, local hippocampal EEG during 
spatial cue switching from A to B (filtered at 6-11 Hz). Bottom, dot-product 
correlation between momentary population vectors and reference vectors from A 
(red) and B (blue) for successive theta cycles before and after cue switching. Dot 
products are un-normalized (just divided by the number C of recorded cells). All 
correlations are positive but, for clarity, A and B correlations are plotted in 
opposite directions. Green line indicates light switch. EEG and ensemble activity 
were sampled simultaneously. Note that ensemble activity flickered back to the A 
representation several times after cue switching. Note also the variation in the dot 
product. b, Another example of network flickering induced by switching of 
spatial cues. c, Matrices reporting the number of cycles falling in each 0.2 X 0.2 
bin of the dot-product correlations rer,,,/C between momentary population 
vectors and reference vectors for the present environment (x-axis) or the 
alternative environment (y-axis). Left, before cue switching. Right, after cue 
switching (starting from the first cycle correlated with new environment). Note 
that mixed cycles, defined as cycles with both rer, and rery exceeding C (x > 1, 
y > 1; indicated by red lines), were rare. d, Histograms showing that the number 
of mixed states after cue switching (red line) is lower than expected from shuffled 
versions of the same data (n = 1,000; grey histogram). Note that mixed states 
became less frequent during the second half of the theta cycle. 


expected from shuffled data during both half-cycles; however, in the 
shuffled data, the frequency of recombinations with more mixed popu- 
lation vectors than in the observed data increased from 958/1,000 
during the first half (P<0.05) to 1,000/1,000 during the second 
(P < 0.001) (Fig. 2d). The low incidence of mixed population vectors 
at the end of the theta cycle suggests that representations evolve from 
partially segregated to fully segregated within each activity period. 
We then asked how A-correlated and B-correlated theta cycles were 
organized in time. Because dot products can vary from 0 to indefinitely 
large, we switched to Pearson product-moment correlations, which 
by normalizing the correlations to within a fixed [—1,+1] range 
allow successive theta cycles to be compared more directly (Fig. 3a—c 
and Supplementary Figs 4-6). Individual theta cycles were now only 
considered if at least two cells were active (for higher thresholds, see 
Supplementary Fig. 7). As observed with the dot products, the 
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Figure 3 | Temporal dynamics of network flickering. a—c, Pearson product- 
moment correlations showing evolution of population vector correlations after 
spatial cue switching from box I to II (red, correlation with I; blue, with II; green 
line, cue switch). Note frequent flickers to the original representation after cue 
switching in a and b. c, Spontaneous flickering between cue-switch events. 
Frames: sequences detailed in d and Supplementary Fig. 8. d, Spatial 
distribution of correlations between momentary population vectors and 
reference vectors in the framed area in a. Each row shows correlation matrices 


momentary population vectors correlated strongly with either A or B 
but rarely with both. Before cue switching, theta cycles were nearly 
exclusively correlated with reference vectors for the initial environ- 
ment (I). After cue switching, the network switched almost instanta- 
neously to high correlation with reference vectors for II but then 
relapsed to I several times during the subsequent seconds (Fig. 3a, b 
and Supplementary Figs 5 and 6) and occasionally tens of seconds after 
cue switching (Fig. 3c). These relapses, or flickers, were confined to 
discrete periods of one or several theta cycles. When the population 
vector was correlated with reference vectors from other locations in the 
represented environment, the correlation generally decreased with 
distance from the animal (Fig. 3d and Supplementary Fig. 8). 

To quantify the frequency, timing and duration of flicker events, we 
defined individual theta cycles as A-correlated if the correlation with 
reference environment A was above the 95th percentile for B < A cor- 
relations in the reference sessions (that is, more similar to A than 95% of 
theta cycles in B) and if the correlation with reference environment B was 
simultaneously below the 5th percentile for B x B (that is, different from 
most theta cycles in B) (Supplementary Figs 9 and 10). The analysis 
showed a clear increase in the frequency of network flickers during the 
first seconds following the cue switch from I to II, after the network had 
switched to the II representation for the first time (generally 0-1 s after 
the cue change; Supplementary Fig. 11 and Supplementary Table 1). The 
fraction of theta cycles participating in flicker episodes, estimated with 
Pearson correlations, increased from a stable baseline of 1-3% before cue 
switching to a level of 10-15% during the first 5 s after the first network 
switch (Fig. 3e and Supplementary Fig. 12), confirming the tendency 
revealed by the dot-product analyses (Fig. 2c). The interquartile range of 
flicker durations increased from 1-1 theta cycles before cue switching to 
1-4 during the first 10s after cue switching (Wilcoxon rank-sum test: 
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for 16 consecutive theta cycles. Top, correlation with I; bottom, with II. 
Correlation is colour coded (scale bar). + indicates rat position. e, Percentage of 
flickers to the alternative representation as a function of time before and after 
cue switching. f, Distribution plots showing the time for the network to switch 
from present (‘correct’) to past (‘incorrect’) representation, or vice versa, in 
number of theta cycles. One cycle means that cycles with alternative 
representations were consecutive. Note the predominance of immediate 
transitions. 


Z = 2.27, P<0.03; Supplementary Fig. 11b and Supplementary Table 
1). Flicker events were distributed across the entire recording box 
(Supplementary Fig. 13) and showed no preference for running or head- 
ing direction (Supplementary Fig. 14). Flickering was apparent also in 
CA1 but discrete relapses were clearly less frequent, possibly due to the 
less reverberating architecture of this subfield (Supplementary Fig. 15). 

Transitions between representations occurred within less than a 
single theta cycle. After the last cycle in a series of non-flicker theta 
cycles, the alternative representation was fully expressed already in the 
subsequent theta period in 30.8% of the flicker events where flicker 
duration could be determined (Fig. 3f). The corresponding percentage 
of single-cycle returns to the ‘correct’ representation was 32.0%. In 
57.0% of transitions from ‘correct’ to ‘incorrect’ reference frame and 
62.2% of transitions back from ‘incorrect’ to ‘correct’, the time course 
remained undetermined because the intervening theta cycle contained 
fewer than two spikes. In each instance of an immediate transition, the 
network representation was fully developed from the outset, that is, the 
correlation with the new environment did not increase further within 
the flicker period (r = —0.06 + 0.11). 

To determine further if the flicker episodes were patterned by the 
theta oscillation, we finally compared the transition dynamics of our 
theta-based segmentation procedure with segmentations based on fixed 
time bins of different width (range 44-500 ms; Supplementary Fig. 3). 
Among the fixed bins, direct transitions were most abundant when the 
bins matched the average duration of theta cycles (125 versus 120.4 ms, 
respectively). The abundance of sharp transitions increased further 
when the trial was segmented by actual rather than fixed theta periods, 
with cycles split at the phase with the minimum firing rate (Fig. 1b, c, e 
and Supplementary Fig. 3). Collectively, these observations suggest 
strongly that the transitions were paced by the theta rhythm. 
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Our study provides evidence for competitive interactions between 
hippocampal representations during changes in spatial reference 
frame. Although a small subset of the population vectors correlated 
with both reference environments after the cue change, the number of 
such mixed states was lower than expected from a sample of independ- 
ent single units, especially during the second half of each theta cycle. In 
most cases, the network either switched all at once or flickered between 
mutually exclusive representations until, after a few seconds, it settled 
in one of the alternatives. The sustained separation of the neural activ- 
ity pattern is consistent with the notion that spatial environments, as a 
whole, are stored as discrete attractors in neural networks of the hip- 
pocampus or associated areas such as the entorhinal cortex’ **”. These 
discontinuities differ from the continuous or quasi-continuous nature 
of spatial maps for individual environments'*’*°, where sweeps can 
follow unbroken trajectories, even when retrieval occurs in the absence 
of actual movement*’. Changes in attention or experience are likely to 
generate continuous transitions of the latter type more or less con- 
stantly in all environments’**”*. The low frequency of flickering in the 
baseline state of the teleportation task suggests, however, that switches 
to uncorrelated attractor maps are rare and occur primarily when cues 
are ambiguous or in conflict across sensory modalities. 

The time course of flickering episodes has implications for the 
mechanisms of ensemble activation. It took often only a single theta 
cycle to fully reactivate a pre-established representation during a flicker 
event, consistent with models of theta phase precession in which 
ensembles are activated by propagation through recurrent collaterals 
after afferent input to a subset of cells early in the theta cycle’’. The idea 
that sensory influences can override attractors at the beginning of the 
theta cycle, whereas subsequent activity is determined more exclusively 
by propagation through associative connections, receives further sup- 
port from the fact that the small number of mixed population vectors 
occurred primarily during the first half of the theta cycle. The recreation 
of spatial representations on successive theta cycles in the hippocampus 
is fundamentally different from the pattern-completion dynamics 
observed, for example, in the inferior temporal cortex, where the activa- 
tion may proceed only once per stimulus presentation, at a timescale 
short enough to facilitate perception but with little opportunity for 
error correction”*. In the hippocampus, repeated convergence to an 
attractor state might allow the system to self-correct and thereby 
enhance its discriminative power under conditions where input cues 
are weak and ambiguous. The present data point to theta cycles as 
organizational units for this repetitive process but do not preclude 
additional structure at faster time scales, for example with gamma 
cycles as units for cell assembly sequences within the theta cycle**°. 


METHODS SUMMARY 


Neuronal ensemble activity was recorded from ensembles of CA3 cells while rats 
foraged in either of two distinct enclosures in a dark room. The enclosures were 
identical except for internal lights. Testing began by placing the rat in one of the two 
boxes. After 40-60 s, the light cues were switched, effectively ‘teleporting’ the animal 
to the other environment. To examine the evolution of network activity, population 
vectors were defined from the firing rates of all simultaneously recorded cells for 
every theta cycle before and after the switching of spatial cues. Each population 
vector was correlated with reference vectors defined from the activity of the same 
cells at the same spatial location in each box on separate baseline trials. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Subjects. Six male Long Evans rats (400-500 g at implantation) were housed 
individually in transparent Plexiglass cages (45 cm X 30cm X 35 cm). The animals 
were kept at ~90% of their initial free-feeding body weight and maintained on a 
12-h light/12-h dark schedule. All testing occurred in the dark phase. The experi- 
ments were performed in accordance with the Norwegian Animal Welfare Act and 
the European Convention for the Protection of Vertebrate Animals used for 
Experimental and Other Scientific Purposes. 

Electrode preparation and surgery. Neuronal ensemble activity was recorded 
from ensembles of CA3 or CAI cells in rats implanted with a ‘hyperdrive’ contain- 
ing 14 independently movable tetrodes assembled in a circular bundle. Tetrodes 
were twisted from four 17-|1m polyimide-coated platinum-iridium wires (90% and 
10%, respectively; California Fine Wire Company). Electrode tips were plated with 
platinum to reduce electrode impedances to 120-200 kQ at 1 kHz. 

The animals were food deprived 12h before surgery started. In four animals, 
anaesthesia was induced by first placing the animal in a closed glass box filled with 
isoflurane vapour and then giving the animal an intraperitoneal injection of 
Equithesin (pentobarbital and chloral hydrate; 1.0 ml per 250g body weight). 
Two animals (15272 and 15273) were anaesthetized with isoflurane (induction 
chamber level of 4.0% while the rats were secured in the stereotaxic apparatus, with 
an air flow at 1,400 ml min 7}; isoflurane was then gradually reduced to 1-2% 
during the course of the surgery). Supplementary anaesthesia was given when 
breathing and reflexes changed. Local anaesthetic (Xylocain) was applied on the 
skin before making the incision. The hyperdrive was then implanted. The tetrodes 
were inserted above CA3 of the right hippocampus, with the centre of the bundle at 
anterior-posterior 3.8mm and medial-lateral 3.0mm relative to bregma. 
Jewellers’ screws and dental cement were used to secure the hyperdrive to the 
skull. Two screws were connected to hyperdrive ground. All tetrodes were turned 
after the surgery to be sure they were in the brain. 

Tetrode positions. Over the course of ~3-4 weeks, the majority of the tetrodes 
were lowered towards CA3 in steps of 50 pum or less while the rat rested on a towel in 
a large flower pot on a pedestal. Turning was slowed down when large-amplitude 
theta-modulated complex-spike activity appeared in CA3 at depths of approxi- 
mately 3.0 mm. The tetrode depths were tweaked to get the maximal number of 
simultaneously recorded CA3 cells at the start of cue switching. To maintain stable 
recordings, the electrodes were not moved at all on the day of recording. A few 
tetrodes were left in CA1; data from these tetrodes were analysed separately. Two of 
the tetrodes were used, respectively, to record a reference signal from the corpus 
callosum and an EEG signal from the stratum lacunosum-moleculare. 
Behavioural training procedures. The rats were trained to collect food morsels in 
either of two distinct 60 cm X 60 cm enclosures with 40-cm walls located in a dark 
curtained environment (Supplementary Fig. 2). The boxes rested on a plexiglass 
plate fixed 10 cm above the floor of the room. The boxes were identical except for 
the arrangement of a number of internal lights. Beneath the plexiglass plate of box 
A there was a panel of eight light-emitting white diodes (LEDs) organized into a 
circle (50cm in diameter) and placed centrally under the plexiglass plate to be 
visible through the floor. The box was polarized by another LED at the upper edge 
of one of the walls. Box B was illuminated by a 60-cm-long array of green LEDs 
lining 40 cm of the upper edge of the wall opposite to the directional LED in A and 
20cm of one of the adjacent walls (Supplementary Fig. 13a). The LEDs were the 
only light source in the room. 

Training occurred in four stages. During stage 1, the boxes were located next to 
each other, connected by a 20 X 20 cm (width X length) passageway that allowed 
the rat to shuttle between the boxes in order to associate each box with a different 
set of path integrator coordinates’. The rat was permitted to travel freely between 
the boxes for 20 min on at least three trials. Trials were separated by 20 min 
intervals, during which the animal rested on a towel in a pedestal outside of the 
curtains. At stage 2, the corridor was removed and the animal explored the boxes 
individually on alternating trials (3 trials in each). At stage 3, the boxes were 
replaced by a single box made of the same material and equipped with both sets 
of lights. The box was placed on alternating trials at the two original locations. 
When presented at the original position of box A, the set of LEDs defining A was 
switched on; when in the place of box B, the respective lights of B were active 
instead. Again, the rat was tested 3 times in each environment on alternating 
occasions. Finally, at Stage 4, the box was moved to a central location between 
the two original box locations. The animal received alternating trials with each set 
of lights (2 consecutive days, each day 3 pairs of trials). During all stages, at the start 
of each trial, the rat was taken from the flower pot on the pedestal outside of the 
curtains and placed, without disorientation, into the environment with the eyes 
gently covered by the experimenter’s palm. Between trials, the rat rested for 20 min 
in the flower pot. During this period, the boxes were thoroughly washed with a wet 
tissue and dried. 
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On the test day, the rat started with a 20 min rest trial in the flower pot. The 
animal was then tested for 10 min in each box configuration (A and B, respec- 
tively). Then, during the third trial, after 40-60 s of baseline recording in one of the 
configurations (for example, A), the lights were switched instantaneously to the 
other configuration (for example, B). Additional ‘teleportations’ were performed 
every subsequent 40-60 s until 10 min had passed. A resting trial in the flower pot 
was recorded at the end. The electrodes were then lowered deeper into the CA3 
and the experiment was repeated on the subsequent day in those cases where 
spikes with sufficient amplitudes from new cells could be recorded. 

Running was motivated by small crumbles of cookies thrown into the box at 
10-20 intervals. Three types of cookies were used: vanilla, chocolate and unfla- 
voured. Vanilla and unflavoured were given in configuration A, chocolate and 
unflavoured in configuration B. The proportion between flavoured and unfla- 
voured cookies was approximately 1:1 during stage 1 and 1:4 during the following 
stages. During teleportation trials, only unflavoured crumbles were offered. 
Population vector analyses. Theta waves were identified from filtered local EEG 
traces as described in the Supplementary Methods. The evolution of unit activity 
over successive theta cycles before and after cue switching was estimated by 
defining a population vector for each theta cycle, consisting of the number of 
spikes fired by each cell in the array, and comparing it to a population vector 
for the same cells at the corresponding location in each of the reference environ- 
ments (Fig. 1a, c). Boundaries between theta cycles were determined by plotting, 
for all cells on the entire teleportation session, the number of spikes as a function of 
theta phase (Fig. 1b). The boundary between successive theta cycles was then 
defined by the theta phase with the lowest overall firing rate (bins of 10 deg; 
Fig. 1b, c). For each theta cycle, the momentary population vector was correlated 
with reference population vectors for the same cell sample at the same location in 
each of the reference environments. The reference vector (expressed in Hz rather 
than as a spike count, normalized by the time spent at each location) was based on 
activity across the entire reference trial for each of the environments on the test 
day. In a subset of the analyses, the momentary population vector was compared 
with all reference population vectors in the two boxes, that is, not only those 
corresponding to the animal’s current location. Correlations between momentary 
population vectors and reference vectors were quantified by dot products as well as 
Pearson product-moment correlations. Dot products were calculated for all theta 
cycles, including those with no activity, where the dot product is zero. Pearson 
correlations, which cannot be defined for theta cycles with no spikes, were calcu- 
lated by excluding also theta cycles with a single unit active, that is, these correla- 
tions were based on the subset of theta cycles that included at least two active cells. 
Ina subset of the analyses, also cycles with only two active units were excluded, and 
in a further control, also those with only three active units were discarded 
(Supplementary Fig. 7). 

To determine if the similarity between momentary population vectors during 
cue switching and population vectors in the reference environment was larger than 
expected by chance, we correlated population vectors from successive theta cycles 
in reference recordings from A and B with mean population vectors generated 
from activity in the same position across the entire session in the same environ- 
ment or the alternative environment. When referenced to the same environment 
(AX A or BX B), the distribution of Pearson correlations was centred at high 
correlation values with a long tail of low values (Supplementary Fig. 9, top row). 
When referenced to the alternative environment (A X B or B X A), the distribu- 
tions were centred at slightly negative values with a long tail towards infrequent 
high positive values (Supplementary Fig. 9, bottom row). We then determined 5th 
and 95th percentile values for each distribution and used these as criteria to 
identify flickers to the alternative representation. Individual theta cycles were 
defined as A-correlated if the correlation with reference environment A was above 
the 95th percentile for B < A (that is, more similar to A than 95% of the theta cycles 
in B) and if the correlation with reference environment B was below the 5th 
percentile for B X B (that is, more different from the reference in B than most 
theta cycles in B). Conversely, theta cycles were defined as B-correlated if the 
correlation with reference environment B exceeded the 95th percentile for 
A XB and if the correlation with reference environment A was lower than the 
5th percentile for A < A. In separate analyses, the momentary population vectors 
were compared not only with reference vectors at the same location but at all 900 
bin locations in the box (Supplementary Fig. 8). 

The abundance of theta cycles that correlate with both reference environments 
(‘mixed’ theta cycles) was determined by comparing the data with spike patterns 
obtained by ‘shuffling’ the activity of individual units. For each theta cycle during 
the 10s preceding and succeeding the cue switching, the number of spikes pro- 
duced by each unit recorded during the cue switching was drawn at random from 
among all theta cycles recorded in the same physical location in the same environ- 
ment during the corresponding period (most of these theta cycles were correlated 
with the current environment, some with the alternative one). The analysis was 
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limited to theta cycles for which a minimum of 3 theta cycles had been recorded 
in that particular location in the relevant period, with ‘same’ location defined as the 
same 6cm X 6cm spatial bin of the same recording box. The shuffled popula- 
tion vectors can be conceived as approximating the linear combination 
ts ~ Aer, + Berg where A and B=1-—A are random binary vectors (for 
example, (0,1,0,0,1,0,1,1,...)) indicating whether each unit was drawn from the 
A orB representation of that location. If the current environment is A, a particular 
theta cycle in the original data can express activity close to the reference vector A in 
that location, or to the reference vector B (for example, during a flicker event) or to 
a mixture of the two, or just noise or, rather frequently, no activity at all. The 
abundance of mixed theta cycles was determined by the un-normalized dot prod- 
uct DP of the population vector in each theta cycle with the reference vectors in A 
and B, DP, = rera/C and DP, = rer /C (just divided by the number C of simul- 
taneously recorded units). DP values range from 0 to about 10 (note that they are 
expressed in Hz, as the momentary population vector includes spike counts 
whereas the reference vector indicates firing rates), and very rarely beyond, but 
are mostly clustered close to zero because of the presence of theta cycles with no 


spikes or with, for example, a single spike from a unit that emits on average less 
than 1 spike in that physical location in both reference vectors. We arbitrarily set 
DP > 1 as the criterion for substantial correlation, and DP > 0.5 for half-cycles. 
The speed of instantaneous transitions from one representation (for example, A) 
to the other (for example, B) was estimated by cross-correlating the sequence of 
correlations with reference environment A and the sequence of correlations with B 
one time-bin later, excluding, in this analysis, units with overlapping fields in the two 
environments and trials with less than 20 non-overlapping units (2 out of 11; 
Supplementary Fig. 3). Bin widths were then varied from 44 ms to 500 ms to deter- 
mine the transition time that gave the largest cross-correlation. Bins were also 
defined by actual theta cycles, with separations at the point of minimum spike 
activity in the population (Fig. 1b, c) and at other phases 10 deg apart (Supplemen- 
tary Fig. 3). Cross-correlation values were corrected for overlap between A and B 
representations by subtracting the cross-correlation at zero time lag. 
Other. Recording procedures, criteria for spike sorting, construction of rate maps, 
analysis of theta rhythm and histological procedures are described in Supplemen- 
tary Materials and Methods. 
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Corridors of migrating neurons in the human brain 
and their decline during infancy 


Nader Sanai>??, Thuhien Nguyen, Rebecca A. Ihrie’, Zaman Mirzadeh!*, Hui-Hsin Tsai!, Michael Wong’, Nalin Gupta’, 
Mitchel S. Berger’, Eric Huang“, Jose-Manuel Garcia-Verdugo®, David H. Rowitch!*° & Arturo Alvarez-Buylla’? 


The subventricular zone of many adult non-human mammals 
generates large numbers of new neurons destined for the olfactory 
bulb’*. Along the walls of the lateral ventricles, immature neuronal 
progeny migrate in tangentially oriented chains that coalesce into a 
rostral migratory stream (RMS) connecting the subventricular zone 
to the olfactory bulb. The adult human subventricular zone, in 
contrast, contains a hypocellular gap layer separating the ependy- 
mal lining from a periventricular ribbon of astrocytes’. Some of 
these subventricular zone astrocytes can function as neural stem 
cells in vitro, but their function in vivo remains controversial. An 
initial report found few subventricular zone proliferating cells and 
rare migrating immature neurons in the RMS of adult humans’. In 
contrast, a subsequent study indicated robust proliferation and 
migration in the human subventricular zone and RMS*”. Here we 
find that the infant human subventricular zone and RMS contain an 
extensive corridor of migrating immature neurons before 18 
months of age but, contrary to previous reports®, this germinal 
activity subsides in older children and is nearly extinct by adult- 
hood. Surprisingly, during this limited window of neurogenesis, not 
all new neurons in the human subventricular zone are destined for 
the olfactory bulb—we describe a major migratory pathway that 
targets the prefrontal cortex in humans. Together, these findings 
reveal robust streams of tangentially migrating immature neurons 
in human early postnatal subventricular zone and cortex. These 
pathways represent potential targets of neurological injuries affect- 
ing neonates. 

We collected human brain specimens from 10 neurosurgical re- 
sections and 50 autopsied brains, ranging in age from birth to 84 years 
(Supplementary Fig. 1 and Supplementary Table 1), using a protocol 
that allows subsequent analysis by fluorescent immunohistochemistry 
and in situ hybridization (see Methods). As shown (Fig. 1), staining of 
horizontal sections (30 um) through the anterior horn of the lateral 
ventricle demonstrates that, in the first 6 months of life, the structure of 
the human subventricular zone in infants differs considerably from 
that observed in adults. The astrocytic ribbon and gap layer are not 
evident (Fig. 1) and, as seen in the fetal human brain’*”, cells with 
elongated radial glial processes line the lateral ventricular wall and 
express vimentin and glial fibrillary acidic protein (GFAP)'’. Adjacent 
to these radial glia, we observed a dense network of elongated unipolar 
and bipolar cells oriented tangentially to the ventricular lining. Many of 
these cells expressed the immature neuronal markers doublecortin 
(DCX) (Fig. 1) and B-III tubulin (TUBB3). Some putative immature 
neurons also expressed polysialylated neural cell adhesion molecule 
(PSA-NCAM), which is present in migratory cells. They also had ultra- 
structural features of immature migrating neurons (Supplementary 
Fig. 2) similar to neuroblasts described in the rodent subventricular 
zone’. Progressively, between 6 to 18 months of age, the subventricular 
zone is depleted of this dense network of putative migratory neurons 


and adopts the characteristic adult structure with an astrocyte ribbon 
and hypocellular gap layer. The emergence of the gap layer coincides 
with the decline in DCX* immature neurons (25-fold during the first 6 
months; n = 16, ages 0-17 years; Fig. 1) and proliferation, suggesting 
that human subventricular zone neurogenesis decreases drastically 
during the first 6 months of life. Only a small number of proliferating 
cells were present in adolescents and adults. Expression of the pro- 
liferation marker Ki67 was not associated with pyknotic nuclei, 
although we cannot exclude that some Ki67* nuclei correspond to 
apoptotic cells induced by ischaemia'*. We also identified a subpopula- 
tion of epidermal growth factor receptor (EGFR)-positive cells, a marker 
associated with early progenitors including neural stem cells'* and 
transit-amplifying cells in mice’*, which similarly diminished with time 
(Supplementary Fig. 3). A subset of EGFR* cells expressed Ki67, while 
others co-localized with DCX or PSA-NCAM, potentially representing 
transitional stages from transit-amplifying cells to immature neurons. 
Whole-mount en face preparations of the subventricular zone also con- 
firmed massive numbers of tangentially oriented DCX™ chains within 
the gap layer at 1 week and 2 months of life. In stark contrast, cells with 
the morphology and marker expression of immature migratory neurons 
were extremely rare in adults (Supplementary Fig. 4). 

Our results indicate that robust streams of tangentially migrating 
immature neurons initially populate the postnatal human gap layer; 
these pathways become depleted between 6-18 months and this region 
transitions into a hypocellular gap. Previous work has also demon- 
strated a similarly sharp decline in EGFR and PSA-NCAM immuno- 
reactivity in the human subventricular zone during the first year of 
life’. Within the subventricular zone, the decline of migratory imma- 
ture neurons appears first along the posterior third of the lateral vent- 
ricle and then progresses in a posterior-to-anterior trajectory towards 
the ventral tip (data not shown). After 18 months, Ki67~ proliferative 
activity and the number of DCX* immature neurons assume trace 
levels seen in adults, leaving behind the gap layer characteristic of the 
adult human subventricular zone’. 

We next investigated whether proliferation and the presence of puta- 
tively migrating, immature neurons in the subventricular zone were 
associated with an active RMS. Using autopsied material (n = 6) (ages 
1 day; 1 week; 1, 3 and 6 months), serial sagittal and coronal recon- 
struction of the ventral forebrain revealed an uninterrupted column of 
cells that connected the ventral tip of the subventricular zone to the 
olfactory peduncle (Fig. 2). The descending/proximal limb of the pae- 
diatric RMS contained cells organized as chains—large collections of 
elongated immature neurons expressing DCX and surrounded by glial 
cells and processes'*'*'? (Fig. 2)—or as broad streams of individual cells. 
A subpopulation of these DCX” cells expressed PSA-NCAM. Con- 
versely, analysis of tissue from older children (n = 7; 2, 3, 7, 16, 17 years) 
failed to reveal chains of migrating cells or evidence of an active 
RMS. However, individual or pairs of elongated DCX* PSA-NCAM* 
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Figure 1 | Cytoarchitectural development of the human subventricular zone 
during the first 18 months of life. a—q, Illustrations localizing the horizontal 
30-uum sections of the anterior ventral subventricular zone (a, b) labelled with 
nuclear marker DAPI (c, f,i, 1, 0), the immature migrating neuronal marker DCX 
(d, g, j, m, p) and the immature glial marker vimentin (e, h, k, n, q). F, frontal 
lobe; LV, lateral ventricle; OB, olfactory bulb; T, temporal lobe. Large numbers 
of DCX* immature migrating neurons populate the region that develops into 
the human gap layer (yellow dotted lines). e, h, k, n, q, In the first few months of 
life, radial glia-like vimentin” cells are seen to populate the ventricular lining 
(e, h), and at 12 months onward, a dense network of vimentin processes fill 
the gap area (n, q). DCX and DAPI are co-labelled in the same section (first and 
second columns), whereas vimentin staining is from adjacent sections (third 
column). r, DCX* immature neurons within the anterior ventral 
subventricular zone were quantified in ten X20 fields per section, ten sections 
per specimen, and 1-3 specimens per time point. Each data point represents 
the mean + s.d. of a single specimen at the designated age (m, month; w, week; 
y, year). Inset demonstrates the field of quantification (yellow box) for each of 
the ten horizontal sections per specimen. s, Proliferating Ki67* subependymal 
cells were similarly quantified along the anterior ventral human subventricular 
zone. Taken together, these data demonstrate, during the first 18 months of life, 
a sharp decline in proliferating cells and immature migrating neurons 
coinciding with emergence of the human subventricular zone gap layer. Scale 
bars, 30 fm. 
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Figure 2 | The infant human RMS connects the subventricular zone to the 
olfactory peduncle. a, b, Illustrations indicating the anatomical site, plane and 
orientation of tissue sectioning. F, frontal lobe; LV, lateral ventricle; 

OB, olfactory bulb; T, temporal lobe. c, Reconstruction of serial sagittal sections 
of a haematoxylin-stained RMS from inferior lateral ventricle (V) to olfactory 
peduncle (OP) of an infant specimen (5 months of age). The subventricular 
source of the proximal RMS shows multiple tributaries of cells (yellow arrows) 
that appear to coalesce forming the descending (proximal) limb of the RMS. 
There is no evidence of a continuous ventricular system, but isolated 
ependymal islets (yellow arrowheads) are observed along the RMS. The distal 
limb of the RMS moves laterally, out of the sagittal plane of sectioning, as 
indicated by the yellow section frames. Yellow box (C) indicates region of 
proximal limb of RMS shown at high resolution in d. d, Clusters of DCX* PSA- 
NCAM* immature neurons at higher magnification. e, Individual DCX* 
neuronal chains, seen here in the ventral tip of the anterior horn of a 1-week 
specimen, are typically embedded in a matrix of GEAP™ glial processes and 
often migrate along blood vessels (see Supplementary Fig. 4d). f, Ultrastructural 
analysis of a neuronal chain cross-section identifies immature neurons (type A 
cells, labelled ‘A’) surrounded by glia (type B cells, labelled “B’). Scale 

bars, 500 mm (c), 50 tum (d), 10j1m (e) and 2 tm (f). 


putative migratory neurons were occasionally noted in late childhood 
specimens (n = 2; 3 and 7 years) and adult specimens (n = 5, ages 30, 
41, 61, 74, 84 years). These observations indicate that, whereas the 
infant brain contains a robust RMS with massive chain migration, 
such activity is reduced markedly in older children and adults’. 
Although our data do not rule out that rare immature neurons may 
sporadically migrate within the subventricular zone and RMS at later 
stages””, these findings do not support the previous finding of robust 
proliferation and abundant migration within the adult human subven- 
tricular zone’. 
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The distal limb of the RMS delivers subventricular zone neuronal 
progeny to the human olfactory peduncle, and then to the olfactory 
tract and olfactory bulb. Serial cross-sections of the olfactory tract 
(n=6; 1 day; 1 week; 5, 6 and 8 months) revealed clusters of 
DCX* PSA-NCAM* immature neurons within the V-shaped central 
core (Fig. 3). Electron microscopy analysis of the 6-month olfactory 
tract (n = 3) confirmed ultrastructural features of migrating immature 
neurons within the core. Cross-sections of the proximal olfactory tract 
core (5mm anterior to the olfactory peduncle) at 8 months contained 
234 + 26 total cells and 173 + 24 DCX" cells per section, respectively, 
while the distal olfactory tract core (5mm posterior to the olfactory 
bulb) contained 216 = 20 total cells and 115+ 14 DCX* cells. In 
contrast, DCX* neurons were not detected in older olfactory tract 
specimens (n = 3; 18 months, 7 and 13 years) (Fig. 3). These data 
further support that an active RMS exists in early childhood, but is 
greatly reduced after 18 months of age. 

Although it has been suggested that an open olfactory ventricle 
persists into adult life’, we found no evidence of a ventricular extension 
in the olfactory tract at any of the ages studied, consistent with previous 
data indicating that the olfactory ventricle fuses before birth”°”°. Along 
the proximal limb of the RMS, we observed ependymal islets, but no 
evidence of a continuous open ventricle. These displaced and discon- 
tinuous islets were lined by multiciliated cuboidal cells expressing the 
ciliary marker acetylated-tubulin (Supplementary Fig. 5). In all studied 
specimens (n = 23), the anterior horn of the lateral ventricle was open, 
but no continuous, ependymal-lined lumen extending into the RMS or 
olfactory tract was observed. Thus, we infer that the human olfactory 
tract serves as a conduit for neuronal chain migration to the olfactory 
bulb, but these chains of migratory cells are only evident in infants and 
occur in the absence of a ventricular extension. 

In tracing the ependymal islets of the paediatric human RMS with 
serial coronal reconstructions ( = 3), we noted a decrease in calibre of 
the RMS from the proximal to distal limb (Fig. 4). On the basis of 
quantification of 4',6-diamidino-2-phenylindole (DAPI) nuclear 
staining, the proximal limb of the RMS contained 548 + 66 total cells 


LETTER ff 


per cross-section and 485 + 58 DCX* immature neurons per cross- 
section. In the distal limb, however, there were 228 + 24 total cells per 
cross-section and 189+17 DCX* immature neurons per cross- 
section, equating to a 58% decline in total cells and a 61% decline in 
immature neurons. The decreasing calibre of the RMS could be due to 
cell death, increased migratory speed, or immature migratory neurons 
taking alternative paths. TdT-mediated dUTP nick end labelling 
(TUNEL)-positive cells were present in the proximal and distal 
RMS, raising the possibility that apoptosis could contribute to this 
decline. However, serial coronal reconstruction of the frontal lobe also 
unexpectedly revealed an additional migratory stream of DCX” cells 
branching off the proximal limb of the RMS and ending in the ven- 
tromedial prefrontal cortex (VMPFC) (Fig. 4). This medial migratory 
stream (MMS) was observed in human specimens aged 4-6 months 
but not 8-18 months. Similar to the RMS, the MMS contains large 
clusters of DCX* and PSA-NCAM* cells with elongated morphologies, 
some adjacent to discontinuous ependymal islets (Supplementary 
Fig. 5). Early time points (<1 month) also revealed a more diffuse pattern 
of medially oriented migratory neurons emanating from the proximal 
limb of the primary RMS (Supplementary Fig. 6). Interestingly, cells 
expressing DCX, PSA-NCAM and the interneuron markers calretinin 
(CalR) and tyrosine hydroxylase (TH), were observed not only within 
this MMS butalso within a restricted subregion of the VMPFC (Fig. 4). 
In contrast, very few DCX*, PSA-NCAM“, CalR* and TH" cells were 
evident in adjacent areas of prefrontal cortex. Although it remains 
possible that alterations in immunoreactivity allow progeny to escape 
into adjacent regions undetected, these observations suggest that the 
MMS diverts immature subventricular zone neurons to target the 
VMPEC. 

A comparable MMS has not been reported in other vertebrates. We 
analysed serial sections of mouse brains at postnatal day (P)4, P8, P16 
and P20 and did not detect a MMS (Supplementary Fig. 7). However, 
some individual DCX™ cells were observed migrating ventrally and 
laterally in juvenile brain studied that might target homologous brain 
regions. In rodents and non-human primates, migratory neurons can 


Figure 3 | Postnatal development and decline of the RMS in the human 
olfactory tract. a, c—e, h-j, Illustration (a) localizes the anatomical sites for 
cross-sectional reconstructions of the proximal human olfactory tract at 1 day 
(c, h), 8 months (d, i) and 7 years (e, j). LV, lateral ventricle; OB, olfactory bulb. 
High-magnification fields (h-j) demonstrate progressive loss of DCX* PSA- 
NCAM* immature neurons in the olfactory tract. An active RMS is evident at 
birth and 8 months of age, with many DCX* PSA-NCAM* cells. By 7 years, 
however, chains of migrating cells are not observed and DCX” cells are rare. 
b, Toluidine blue staining of a longitudinal semi-thin (6-j1m) section from a 


6-month olfactory tract demonstrates a central chain of darkened immature 
neurons surrounded by a matrix of lighter-coloured astrocytes. f, g, Electron 
microscopy also demonstrates clusters of type A cells (labelled “A’) enmeshed 
with type B cells within the olfactory tract core, including intercellular junctions 
(yellow arrows) between immature neurons characteristic of chain migration 
(f) and intermediate filaments (yellow arrowheads) within surrounding type B 
cells (labelled ‘B’; g). Scale bars, 500 jum (c-e); 75 lum (h-j); 10 xm (b); and 
0.5 jum (f, g). 
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Figure 4| A MMS of immature neurons branches from the proximal RMS 
in the infant human brain to supply the VMPFC. a, b, Illustrations 
demonstrate the anatomical site of the RMS (blue) and MMS (red). F, frontal 
lobe; LV, lateral ventricle; OB, olfactory bulb; T, temporal lobe. c, Using cross- 
sections of the proximal and distal RMS, quantifications of the total number of 
cells and total number of DCX" cells per cross-section indicated a significant 
reduction from proximal to distal RMS. Asterisks indicate P< 0.05. Data are 
shown as mean + s.e.m. d, Coronal reconstruction of a 6-month specimen 
reveals a PSA-NCAM* MMS diverging from the RMS to reach the VMPEC. 
e, £, Within this MMS, chains of PSA-NCAM ‘ cells (e) co-express the 
immature neuronal marker DCX (f). g-I, Dense clusters of PSA-NCAM * 
(g), calretinin’ (i) and TH™ (k) cells are specifically observed within a 
subregion of the VMPFC, but not at adjacent cortical sites (h, j, 1) superior or 
inferior to this subregion. Scale bars, 150 tum (d), 20 jum (e-1). 


also escape the subventricular zone along a sagittal plane to reach the 
islands of Calleja*!”’. 

Our study indicates that the region of the subventricular zone 
around the anterior lateral ventricles in the infant human brain is 
highly active, producing many tangentially migrating immature neu- 
rons. On the basis of the presence of an RMS containing chains of 
immature neurons, we infer that at least some of these progeny are 
destined for the olfactory bulb. Beyond 18 months of age, both prolif- 
erative activity and cells expressing markers of immature neurons are 
largely depleted, coinciding with the appearance of a hypocellular gap 
in the postnatal human subventricular zone. Thus, this layer of the 
subventricular zone initially serves as a thoroughfare for immature 
neurons. Surprisingly, paediatric human subventricular zone neuro- 
genesis also seems to serve regions other than the olfactory bulb, as 
evidenced by medially escaping immature neurons near the RMS. 
These groups of cells form a unique MMS targeting a subregion of 
the human prefrontal cortex. 

Previous work has suggested that postnatal neurogenesis may be 
important for learning and memory” and that induction of plasticity 
may be closely linked to the timing of neuronal maturation” ””. We 
speculate that the MMS, and other potential escape pathways from 
the subventricular zone, could supply interneurons to regions of 
the developing human brain as a mechanism of delayed postnatal 
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plasticity. Although the function of this recipient cortical domain, the 
VMPEC, is unknown in children, this region in the adult human brain 
is activated during specific cognitive tasks”*”° including spatial concep- 
tualization and the emotional processing of visual cues. Interestingly, 
the VMPFC is also focally inactivated in patients with advanced 
Alzheimer’s disease*®. Beyond its functional implications, this develop- 
mental study of the human subventricular zone suggests a major period 
of neurogenesis and neuronal migration that extends well into post- 
natal life, but is largely limited to early childhood. This may hold 
important implications for our understanding of neonatal neurological 
diseases, including germinal matrix haemorrhages and perinatal 
hypoxic-ischaemic injuries, each potentially altering subventricular 
zone neurogenesis and its apparent downstream cortical targets at 
formative stages of human development. Perhaps most importantly, 
the detection ofa new migratory route for immature neurons within the 
infant human brain also highlights mechanisms through which 
increased regional complexity may be achieved during brain evolution. 
Note added in proof: A recent study*' has found the RMS at fetal stages, 
but very few DCX-positive neurons in this same region in the adult 
human brain. 


METHODS SUMMARY 


Human specimens. Neurosurgical excisions of normal subventricular zone 
occurred as part of the planned margin of resection surrounding a periventricular 
lesion (Supplementary Table 1). Intra-operative specimens were histologically 
normal with no evidence of dysplasia, and assessments were independently con- 
firmed by an independent neuropathologist. For pathological specimens, autopsied 
brains were cut coronally at the mammillary bodies and immersed in 4% para- 
formaldehyde (PFA) for 1-2 weeks at 4°C, and then stored in 0.1M PBS. All 
specimens were collected with informed consent and in accordance with the 
University of California San Francisco Committee on Human Research (IRB no. 
H11170-19113). 

Rodent specimens. Postnatal mice at the specified ages were transcardially per- 
fused with 0.9% saline and 4% PFA, and dissected brains were post-fixed for 30 min 
in 4% PFA before vibratome sectioning. One-hundred-micrometre floating sections 
were stained and imaged using the tile scanning and three-dimensional projection 
modules on a Leica SP5 confocal microscope. 

Immunohistochemistry. Tissue sections were incubated with primary antibodies 
diluted overnight at 4°C. Sections were then incubated for 2.5h in secondary 
antibodies and then incubated in streptavidin-horseradish peroxidase for 
30 min. Antigen retrieval with proteinase K (10 1g ml’), or 0.01 M citrate buffer 
at 95 °C was used when necessary. 

Whole-mount dissection. Specimens were fixed in 4% PFA for 3 days, then rinsed 
in 0.1M PBS. The anterior horn of the lateral ventricles was excised (3mm 
squares) and rinsed. Tissue specimens were incubated in primary antibodies 
diluted in blocking solution for 2 nights. This step was repeated for secondary 
antibodies incubation. The ventricular face of tissue specimens was microdissected 
and collected at 200-300 tum thickness and mounted. 

Electron microscopy. Specimens fixed in 2% glutaraldehyde and 2% para- 
formaldehyde were cut into 200-mm sections on a vibratome. Sections were 
post-fixed in 2% osmium, rinsed, dehydrated and embedded in Araldite 
(Durcupan, Fluka). To identify individual cell types, ultra-thin (0.05-mm) sections 
were cut with a diamond knife, stained with lead citrate and examined under a Jeol 
100CX electron microscope. 

Statistical measures. Cell counts were quantified and expressed as mean + 
standard deviation (s.d.) or standard error of the mean (s.e.m.). Statistical signifi- 
cance was determined by Student’s t-test. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Human specimens. Neurosurgical excisions of normal subventricular zone 
occurred as part of the planned margin of resection surrounding a periventricular 
lesion (Supplementary Table 1). We recorded the anatomical origin of each intra- 
operative specimen with intra-operative neuronavigation. Intra-operative specimens 
were histologically normal with no evidence of dysplasia, and assessments were 
independently confirmed by an independent neuropathologist. 

For pathological specimens, autopsied brains were cut coronally at the level of 
the mammillary bodies and immersed in 4% paraformaldehyde (PFA) for 1-2 
weeks, and then stored in 0.1 M PBS. The brains were then cut into 1-cm plates 
along the coronal, axial or sagittal plane. The anterior horn of the lateral ventricle 
and the ventral medial aspect, which includes the gyrus rectus and olfactory tract, 
of the frontal cortex were excised. Autopsy specimens were obtained within 12h 
of death from all individuals. All causes of death were non-neurological in origin 
and all patients had no evidence of intracranial disease. All specimens were 
collected with informed consent and in accordance with the University of 
California San Francisco (UCSF) Committee on Human Research (IRB no. 
H11170-19113). 

Rodent specimens. Postnatal male mice at the specified ages were transcardially 
perfused with 0.9% saline and 4% PFA, and dissected brains were post-fixed for 
30 min in 4% PFA before vibratome sectioning. One-hundred-micrometre float- 
ing sections were stained with rabbit anti-doublecortin (Cell Signaling), mouse 
anti-PSA-NCAM (Millipore) and DAPI (Sigma), mounted on Superfrost Plus 
slides, and imaged using the tile scanning and three-dimensional projection 
modules on a Leica SP5 confocal microscope. Composite images were assembled 
from individual X10 fields using Adobe Photoshop. All experiments were 
approved by the UCSF Institutional Animal Care and Use Committee (approval 
no. ANO77716). 

Immunohistochemistry. All fixed specimens were rinsed in 0.1 M PBS and then 
cut on a vibratome (50 ttm), or cryoprotected in 30% sucrose and then cut on a 
cryostat (30 um). Tissue sections were incubated with primary antibodies diluted 
in TNB blocking solution (0.1 M Tris-HCl, pH 7.5, 0.15 M NaCl, 0.5% blocking 
reagent from PerkinElmer) overnight at 4°C. Sections were then incubated for 
2.5h in secondary antibodies diluted in TNB. Sections were then incubated in 
streptavidin-horseradish peroxidase for 30min in TNB, which catalyses sub- 
sequent fluorescent conversion of tyramide substrates (all from PerkinElmer). 


Antigen retrieval with proteinase K (10,gml ') or 0.01M citrate buffer at 
95 °C was used when necessary. 

The following antibodies were used in this study: GFAP (Chemicon), vimentin 
(Sigma), doublecortin (Chemicon), TuJ1 (Covance), PSA-NCAM (Genbiosys), 
Ki67 (DAKO), calretinin (Chemicon), calbindin (Chemicon), tyrosine hydroxylase 
(Chemicon) and EGFR (Upstate Biotechnology). 

Whole-mount dissection. Specimens were fixed in 4% PFA for 3 days, then rinsed 
in 0.1 M PBS. The anterior horn of the lateral ventricles was excised (3 mm squares) 
and rinsed in 0.1% PBS-Triton X-100. Tissue specimens were incubated in primary 
antibodies diluted in blocking solution (10% normal goat serum, 2% Triton X-100, 
0.1 M PBS) for 2 nights. This step is repeated for secondary antibodies incubation. 
DAPI was used (1:500) for counterstaining and Aqua Polymount (Polysciences) 
for mounting. The ventricular face of tissue specimens is microdissected and 
collected at 200-300 um thickness and mounted. 

In situ hybridization. Probes specific to human GFAP, doublecortin and EGFR 
were generated by amplifying fragments from commercially available cDNA 
clones (Open Biosystems) or reverse-transcribed cDNA from human fetal brain 
total RNA (Clontech). Primer sequences for amplifying GFAP (Allen Human 
Brain Atlas primer ID398603) and doublecortin (primer 1D399287) were obtained 
from Allen Institute for Brain Science (http://www.brain-map.org/). Forward 
primer 5’-AGCTCTTCGGGGAGCAGCGA-3’ and reverse primer 5'-TGCAC 
GTGGCTTCGTCTCGG-3’ were used for EGFR. Amplified fragments were sub- 
cloned into pCRII (Invitrogen) and RNA probes were made subsequently using 
DIG RNA labelling (Roche). Sections were cut at 14 um, mounted on Superfrost 
Plus slides. The sections were pre-hybridized for 2 h and hybridized with probes at 
1:100 to 1:200 dilutions at 55 °C overnight. After hybridization, slides were washed 
and incubated with anti-DIG antibody, and developed by BM purple substrate for 
24 to 48h. Slides hybridization with GFAP and doublecortin RNA probes were 
subsequently immunostained with GFAP and doublecortin antibodies as previ- 
ously described. 

Electron microscopy. Specimens fixed in 2% glutaraldehyde and 2% paraformalde- 
hyde were cut into 200-mm sections on a vibratome. Sections were post-fixed in 2% 
osmium, rinsed, dehydrated and embedded in Araldite (Durcupan, Fluka). To study 
subventricular zone architecture, we cut serial 1-mm semi-thin sections and stained 
them with 1% toluidine blue. To identify individual cell types, ultra-thin (0.05-mm) 
sections were cut with a diamond knife, stained with lead citrate and examined under 
a Jeol 100CX electron microscope. 
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Structural basis of RNA recognition and activation by 
innate immune receptor RIG-I 


Fuguo Jiang’*, Anand Ramanathan**, Matthew T. Miller', Guo-Qing Tang”, Michael Gale Jr’, Smita S. Patel? 


& Joseph Marcotrigiano! 


Retinoic-acid-inducible gene-I (RIG-I; also known as DDX58) is a 
cytoplasmic pathogen recognition receptor that recognizes pathogen- 
associated molecular pattern (PAMP) motifs to differentiate between 
viral and cellular RNAs. RIG-I is activated by blunt-ended double- 
stranded (ds)RNA with or without a 5’-triphosphate (ppp), by single- 
stranded RNA marked by a 5’-ppp' and by polyuridine sequences”. 
Upon binding to such PAMP motifs, RIG-I initiates a signalling 
cascade that induces innate immune defences and inflammatory 
cytokines to establish an antiviral state. The RIG-I pathway is highly 
regulated and aberrant signalling leads to apoptosis, altered cell 
differentiation, inflammation, autoimmune diseases and cancer*”. 
The helicase and repressor domains (RD) of RIG-I recognize 
dsRNA and 5’-ppp RNA to activate the two amino-terminal caspase 
recruitment domains (CARDs) for signalling. Here, to understand 
the synergy between the helicase and the RD for RNA binding, and 
the contribution of ATP hydrolysis to RIG-I activation, we deter- 
mined the structure of human RIG-I helicase-RD in complex with 
dsRNA and an ATP analogue. The helicase-RD organizes into a ring 
around dsRNA, capping one end, while contacting both strands 
using previously uncharacterized motifs to recognize dsRNA. 
Small-angle X-ray scattering, limited proteolysis and differential 
scanning fluorimetry indicate that RIG-I is in an extended and 
flexible conformation that compacts upon binding RNA. These 
results provide a detailed view of the role of helicase in dsRNA 
recognition, the synergy between the RD and the helicase for 
RNA binding and the organization of full-length RIG-I bound to 
dsRNA, and provide evidence of a conformational change upon 
RNA binding. The RIG-I helicase-RD structure is consistent with 
dsRNA translocation without unwinding and cooperative binding 
to RNA. The structure yields unprecedented insight into innate 
immunity and has a broader impact on other areas of biology, 
including RNA interference and DNA repair, which utilize homo- 
logous helicase domains within DICER and FANCM. 


To investigate the contributions of the individual domains of RIG-I 
to RNA binding, we used fluorescence anisotropy to determine the 
equilibrium dissociation constants (Ky) of the protein-RNA com- 
plexes. The tightest RNA affinity was observed with the helicase-RD, 
whereas the full-length RIG-I, helicase domain and RD bind dsRNA 
with a 24-fold, 8,600-fold and 50-fold weaker affinity, respectively, and 
all proteins bind RNA with a 1:1 stoichiometry (Table 1 and Sup- 
plementary Fig. 1). Consistent with the affinities, full-length RIG-I 
and helicase-RD demonstrated robust dsRNA-stimulated ATPase 
activity, whereas the helicase domain showed a weak ATPase activity 
(Table 1). Interestingly, the presence of a 5’-ppp in dsRNA does not 
alter the stoichiometry of RNA binding, although the 5'-ppp dsRNA 
binds more tightly than the 5’-OH dsRNA to full-length RIG-I. 

Crystals of RIG-I helicase-RD in complex with ADP*BeF; and 14 
base-pair palindromic dsRNA diffracted to 2.9A resolution (Sup- 
plementary Table 1). The dsRNA maintains an A-form helical con- 
formation and interacts with all four domains of helicase-RD, which 
are arranged into a ring around the dsRNA (Fig. la-c). The RecA-like 
helicase domain (domain 1) progresses to an o-helical domain 
(domain 3) and into the second RecA-like helicase domain (domain 
2) that ends with the RD. The notable features of the RIG-I structure 
are the linkers that connect the domains. Domains 1 and 3 are con- 
nected by a f-strand that is part of the parallel B-sheet in domain 2. 
The RD is connected to domain 2 via a prominent V-shaped linker, 
consisting of two a-helices that interact extensively with domains 1 
and 2 with a buried surface over 1,500 A?. The V-shaped linker extends 
into a proline-rich (”"°KPKPVPD) loop that makes the final connec- 
tion to the RD. Interestingly, T770 at the vertex of the V-shaped linker 
is one of the phosphorylation sites that regulates RIG-I signalling®. 

Molecular surface analysis shows that one end of the dsRNA is 
capped by the entire helicase-RD molecule whereas the opposite end 
of the RNA is exposed (Fig. 1d-f). At the capped end, the 5’ terminus 
of the dsRNA abuts the RD domain but the 3’ terminus is somewhat 


Table 1 | Summary of ATPase rates and affinity measurements of dsRNA binding 


ATPase Affinity measurements* 
RNA ATPase activity RNA Nucleotide analogue Dissociation constant Final anisotropy 
M/(M Xs) (Ka, nM) 

RD 14-bp dsRNA-F No nucleotide 2.6 + 1.20 0.109 
Helicase 14-bp dsRNA 0.89 + 0.04 14-bp dsRNA-F No nucleotide 436+121 0.138 

No RNA Undetectable ADPsBeF3 1300 + 300+ 0.138 
Helicase-RD 14-bp dsRNA 21.8+0.70 14-bp dsRNA-F No nucleotide 0.05 + 0.02 0.213 

14-bp palindromic dsRNA 24.4+0.21 5'-ppp 14-bp dsRNA-F — ADPBeF 3 0.02 + 0.07 0.280 

No RNA Undetectable No nucleotide 0.03 + 0.03 0.190 
Full-length RIG-I 14-bp dsRNA 28.2 + 1.42 14-bp dsRNA-F No nucleotide 1.2 +0.6+ 0.187 

No RNA Undetectable 5'-ppp 14-bp dsRNA No nucleotide 0.44 + 0.27 0.183 
Mean ATPase rate constant from two independent measurements and range is shown. Affinity measurements and range from two or more independent experiments are shown. Initial anisotropy of the free 


fluorescein-labelled dsRNA was 0.10 for all experiments. 
* Data was fitted to a 1:1 (protein:RNA) binding model 
+ Value from single measurement and fitting error is shown. 


1Center for Advanced Biotechnology and Medicine, Department of Chemistry and Chemical Biology, Rutgers University, 679 Hoes Lane West, Piscataway, New Jersey 08854, USA. Department of 
Biochemistry, UMDNJ-RWJ Medical School, 675 Hoes Lane West, Piscataway, New Jersey 08854, USA. °>Department of Immunology, University of Washington School of Medicine, 1959 NE Pacific Street, 


Seattle, Washington 98195, USA. 
*These authors contributed equally to this work. 
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Figure 1 | Structural overview of RIG-I helicase-RD. a—c, Schematic 

representation of the RIG-I helicase-RD, highlighting the RecA-like domain 1 
(blue), the «-helical domain 3 (green), the RecA-like helicase domain 2 (yellow) 
and the RD (red). The linker connecting domain 1 with domain 3 is coloured 
teal, and the V-shaped linker between domain 2 and the RD is coloured orange. 
The ADPeBeF; and dsRNA are shown in stick representation with the 5’ and 3’ 


exposed through a highly basic channel (Fig. 1b, e). The 3’-terminus 
RNA strand will be referred to as the 3’ strand, and the opposite strand 
will be referred to as the 5’ strand. This basic channel may allow RIG-I 


5 


5’ strand 


Domain II (Vc) Domain | (lla) 


Figure 2 | Interactions of the RIG-I helicase-RD with dsRNA and 
ADP»BeF;. a, A schematic representation showing the interactions between 
RIG-I domains and helicase motifs (given in parentheses) with dsRNA located 
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¥ 5’ strand 


strands of the RNA coloured black and beige, respectively. A grey sphere 
denotes the position of the zinc ion in the RD. d-f, The surface of the RIG-I 
helicase-RD coloured for electrostatic potential at +5kTe~ ': blue (basic), 
white (neutral) and red (acidic). The views in panels a, b and ¢ are identical to 
those in d, e and f, respectively. 


to recognize dsRNA with 3’ nucleotide overhangs or 3’ monopho- 
sphates that are products of RNase L digestion’. Additionally there are 
visible channels along the long axis of the dsRNA, which may allow the 
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in the centre. Detailed contacts are shown in the surrounding panels. b, Stick 
representation detailing the interactions of RIG-I helicase motifs with 
ADP+BeEF; and Mg** is shown in the lower left panel. 
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domains to flex and accommodate bulges or noncanonical base pairs 
(Fig. le, f). 

All four domains of the helicase-RD participate in dsRNA binding, 
burying a total of 1,500 A? of surface area to encircle about eight base 
pairs (Fig. 2a). Most contacts are with the sugar phosphate backbone of 
both strands but there are a few base-specific interactions. The F853 in 
the RD stacks over the terminal C1-G14 base pair, and H830 and S854 
form hydrogen bonds with the ribose 2’-OH of Cl (5’ strand) and G14 
(3' strand), respectively. Phosphorylation of $854 and S855, shown to 
negatively regulate RIG-I signalling®, would be predicted to adversely 
affect RNA binding. The RD-RNA contacts in the helicase-RD struc- 
ture are identical to those identified in the structures of the isolated RD 
bound to dsRNA*!°. Residues H847, K858 and K861 of the RD, 
reported to interact with the 5’-ppp, do not make any new interactions 
in the helicase-RD structure. The only RD contact with the 3’ strand, 
other than at the blunt end, is between $906 and the U7 phosphate 
backbone. 

The core helicase (domains 1 and 2) contains characteristic motifs 
critical for RNA binding and ATP hydrolysis’ (Supplementary Fig. 3). 
The two helicase domains together contact five nucleotides from the 3’ 
end of the 3’ strand (G14-C10) and four nucleotides in the middle of 
the 5’ strand (C4—-U7) (Fig. 2a). Motif Ib (S325 and G326) and motif Ic 
(T347, Q349 and N353) of domain 1 contact the ribose-phosphate 
backbone of the terminal base of the 3’ strand (G14), while motif Ia 
(N298, Q299 and I300) interacts with C13 and U12. Motifs IV, IVa, 
IVb and V in domain 2 interact with the 3’ strand from C13 to C10. 
Residues in motif IV (K635, T636 and R637) interact with the phos- 
phate backbone of G11-C10. Similarly, motif IVa (T662, G663 and 
R664) and motif IVb (Q678) contact the backbone of C13-G11, and 
motif V (V699) interacts with the RNA backbone at C10. 

All RIG-I-like helicases contain a large insertion (domain 3) 
between the core helicase domains"’. The structure of the RIG-I-like 
helicase Hef showed a similar «-helical domain 3, except it is rotated in 
RIG-I owing to dsRNA interactions (Supplementary Fig. 4). An 
ot-helix within domain 3 (residues 506-522) of RIG-I runs almost 
perpendicular to the minor groove of the dsRNA and interacts with 
the 3’ strand without contacting the 5’ strand (Fig. 2a). The interac- 
tions of domain 3 with the dsRNA via residues E510, V514 and K518 
extend the helicase contacts to U7 and A8 of the 3’ strand. 

Although most interactions of the RIG-I are with the 3’ strand, all 
three helicase domains are in close proximity to the 5’ strand and two 
new motifs contact the 5’ strand. Motif Ila ?”"KQHPY) immediately 
follows motif II (?”7DECH) and interacts with C4—C6 of the 5’ strand. 
Similarly, N720 in motif Vc in domain 2 contacts the 5'-strand back- 
bone at C6 and U7. These new motifs may represent a general feature 
of helicases binding to dsRNA or dsDNA, as a region similar to motif 
Ila was identified in the Swi/Snf2 family helicases and a Rad54 homo- 
logue dsDNA complex structure”. 

RIG-I helicase-RD contains one ADP*BeF3 molecule bound at the 
interface of domains 1 and 2 via the conserved helicase motifs Q, I, II, 
III, Vaand VI, which are generally involved in ATP binding/hydrolysis 
(Fig. 2b)’. The Q motif makes adenine-specific contacts, whereas the 
other motifs are involved in binding the triphosphate moiety and the 
Mg’*. Motif I (?°7GCGKT) contacts the phosphates and the BeF;, 
which mimics the y-phosphate of ATP as observed in other SF2 
helicases'*. The D372 and E373 of motif II (*”*DECH) coordinate 
the Mg’* to stabilize the ATP analogue. The helicase motifs Va and 
VI, which contact the ribose and phosphates of the ATP in other 
helicases, are close to ADPeBeF3. 

RIG-I has been shown to have translocation activity on dsRNA“ 
and has been reported to unwind short dsRNA”, although we (data 
not shown) and others’* have failed to detect unwinding activity. 
Superposition of RIG-I with hepatitis C virus (HCV) NS3 helicase 
(NS3h) bound to single-stranded (ss) DNA" (Fig. 3a) makes predic- 
tions about translocation and lack of helicase activity of RIG-I. First, 
ssDNA in NS3h overlays with the 3’ strand of the dsRNA bound to 
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5’-OH dsRNA 5’-ppp dsRNA 


Figure 3 | Comparison of RIG-I helicase-RD with HCV NS3h and RD 
bound to 5’-OH and 5’-ppp dsRNA. a, Ribbon diagram showing the 
superposition of the RIG-I helicase-RD-dsRNA-ADP»BeF; structure and 
NS3h bound to ssDNA (PDB accession 3KQH) (grey). The helicase core 
domains 1 and 2 from RIG-I helicase-RD superimpose well, whereas domain 3 
of NS3h is positioned over the RD. b, Superposition of the RIG-I helicase-RD 
with NS3h demonstrates that the ssDNA bound to NS3h overlays with the 3’ 
strand (beige) of the dsRNA bound to the helicase-RD. c, The location of the 
Phe-loop of NS3h relative to the dsRNA of the RIG-I helicase-RD-dsRNA- 
ADP»BeF; structure. d, e, Superposition of the 5’-OH (blue; PDB accession 
30G8) and 5’-ppp dsRNA (magenta; PDB accession 3LRR) based on the 
location of the RD. For clarity the 5’ strands (d) and 3’ strands (e) are shown 
separately. 


RIG-I, suggesting that the RIG-I helicase makes principal motor con- 
tacts with the 3’ strand (Fig. 3b) when translocating along dsRNA. 
Second, a conserved Phe-loop in NS3h bisects the dsRNA, consistent 
with its implicated role in unwinding” (Fig. 3b, c). The absence of 
such a motif in RIG-I could explain the lack of RNA-unwinding 
activity. Interestingly, the RIG-I RD positions over domain 3 of 
NS3h, indicating that this domain of NS3h may interact with the 5’ 
strand of dsRNA (Fig. 3a). 

The RD superimposes well with a root mean squared deviation of 
~0.6 A with the structures of isolated RD bound to dsRNA with and 
without 5'-ppp*”°. However, the trajectory of the RNA helix in the 
helicase-RD is different. Superposition of the three RD structures 
demonstrates that the dsRNA bound to the isolated RD clashes with 
domain 3 (Fig. 3d, e). Thus, the dsRNA in helicase-RD is rotated and 
the previously reported contacts (R811, K849, K851 and H871) 
between RD and RNA are no longer observed in the helicase-RD 
structure. 

It has been proposed that RIG-I exists in an auto-inhibited state’® 
with undetectable ATPase activity (Table 1) in the absence of RNA, 
and activates upon binding viral RNAs. Small-angle X-ray scattering 
(SAXS), limited proteolysis and differential scanning fluorimetry 
(DSF) were performed to gain insights into how RNA binding activates 
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Figure 4 | Limited trypsin digestion, DSF and SAXS analyses of helicase-RD 
and full-length RIG-I in the presence and absence of dsRNA. a, SDS-PAGE 
analysis of a time course (minutes) of limited trypsin digestion of helicase-RD or 
full-length RIG-I in the absence or presence of 14 base-pair palindromic dsRNA. 
b, DSF of RIG-I helicase-RD or full-length (FL) RIG-Iin the presence of 14 base- 
pair palindromic dsRNA and/or ADPeBeF; with respect to protein alone. The 
bar graph shows the mean melting temperature difference (AT,,) and the error 


RIG-I. Full-length RIG-I and helicase-RD undergo conformational 
changes and exhibit greater stability upon RNA binding (Fig. 4). 
The radius of gyration (Rg) of helicase-RD and full-length RIG-I 
decreases upon dsRNA binding by 10 and 2.3 A, respectively (Sup- 
plementary Table 2), consistent with the results from size-exclusion 
chromatography (Supplementary Fig. 1b). The SAXS Kratky plots of 
helicase-RD and full-length RIG-I with RNA are symmetrical para- 
bolic curves (Supplementary Fig. 5), consistent with folded and globu- 
lar complexes. The parabolic shape of the Kratky plot is lost and the 
peak amplitude decreases in the absence of RNA, indicative of greater 
flexibility’’. Limited trypsin digestion and DSF shows a greater stabil- 
ization with dsRNA and a small increase in stability upon addition of 
the ATP analogue (Fig. 4a, b). These data indicate that RIG-I is com- 
posed of globular domains connected with flexible linkers that become 
ordered upon RNA binding. Such conformational stabilization upon 
ATP and RNA binding has been documented in other helicases”. 
A model of full-length RIG-I bound to dsRNA was established from 
ab initio SAXS envelope followed by rigid body refinement of the 
helicase-RD-dsRNA complex and homologous CARD structures 
(Supplementary Table 2 and Fig. 4c—f). The helicase-RD-dsRNA com- 
plex and two copies of the CARD are accommodated in the full-length 
RIG-I SAXS envelope. The two CARDs project from domain 1. Such 
an orientation of the CARDs would allow for interaction with down- 
stream signalling factors****. Interestingly, the second CARD is posi- 
tioned adjacent to the V-shaped linker and close to the T770 
phosphorylation site. The close proximity of the linker to the second 
CARD suggests a possible mechanism for the two «-helices to serve as 
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bars represent the standard deviation from three independent measurements. 
c, d, Ab initio envelope of helicase-RD and dsRNA overlaid with the crystal 
structure of helicase-RD-dsRNA (dsRNA truncated to 10 base pairs). The view 
in dis rotated 90° about a horizontal axis from panel c. e, f, Ab initio envelope of 
full-length RIG-I and dsRNA overlaid with the crystal structure of helicase-RD- 
dsRNA with two copies of CARDs added (PDB accession 2VGQ). The view in 
f is rotated 90° about a horizontal axis from panel e. 


a hinge for RD movement upon RNA binding, leading to RIG-I activa- 
tion. The model indicates that the dsRNA extends along a perpendic- 
ular axis relative to the CARDs and hence can accommodate several 
RIG-I molecules”’. 


METHODS SUMMARY 


Recombinant full-length RIG-I (1-925), helicase-RD (232-925), helicase (232- 
794), RD (795-925) and selenomethione-derivatized helicase-RD were expressed 
in Escherichia coli and purified to homogeneity using immobilized metal ion 
affinity, hydroxyapatite and heparin affinity chromatography. Fluorescence aniso- 
tropy titrations were performed at 25 °C (ref. 30) (Aexcitation 494m and Aemissions 
516 nm) using a fluorescein-labelled 14 base-pair dsRNA prepared by annealing 
5'-GGAGAGAACCGCCU and 3’-CCUCUCUUGGCGGA-F RNA, where F is 
fluorescein. Crystals of the native and selenomethione helicase-RD with pal- 
indromic dsRNA (5'-CGACGCUAGCGUCG) and ADP+BeF;*Mg** were 
obtained in 25% (w/v) PEG 3350, 0.25 M NaSCN, 100 mM MOPS (pH 7.8), 3% 
(v/v) 2,2,2-trifluoroethanol at 20 °C by hanging drop. The crystals belong to space 
group P6522 with cell parameters a = b = 174.9 A andc = 110.9 A. The structure 
was determined by single-wavelength anomalous dispersion (SAD) to 3.2 A reso- 
lution and refined against a 2.9 A resolution native data set. The final model has an 
Rwork and Reree of 0.199 and 0.287, respectively. SAXS measurements were per- 
formed on full-length RIG-I and helicase-RD in the absence and presence of 
dsRNA (5'-GCGCGCGCGC). Buffer subtraction and radius of gyration R, were 
calculated from Guinier plots. The maximum particle size D,nax was determined by 
scanning a range of values and comparing experimental scattered intensity I(s) 
values to distance distribution function P(r) transforms. Ten ab initio models were 
averaged and normalized spatial discrepancy (NSD) values were calculated. The 
helicase-RD-dsRNA and a homologous CARD structure were positioned into the 
ab initio model and 7’ values were determined. 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Protein expression and purification. All the protein constructs were expressed 
with an N-terminal 6His-SUMO fusion. Human full-length RIG-I (1-925), 
helicase-RD (232-925) and helicase (232-794) were expressed in Escherichia coli 
strain Rosetta (DE3) (Novagen) as soluble proteins. The soluble fraction of 
helicase-RD was purified from the cell lysate using a Ni’*-nitrilotriacetate 
(Qiagen) column. The recovered protein was then digested with Ulp1 protease 
to remove the 6X His-SUMO tag and further purified by hydroxyapatite column 
(CHT-II, Bio-Rad) and heparin sepharose column (GE Healthcare). Finally, puri- 
fied helicase-RD was dialysed against 50 mM HEPES (pH 7.5), 50mM NaCl, 
5mM DTT, 10% glycerol overnight at 4°C, snap frozen in liquid nitrogen, and 
stored at —80°C. The RIG-I RD (795-925) was expressed in E. coli BL21 Star 
(DE3) cells and the soluble fraction was purified to homogeneity using a Ni**- 
nitrilotriacetate column, cation exchange (HiTrap SP, GE Healthcare) and gel 
filtration chromatography. Selenomethionine (SeMet)-labelled helicase-RD was 
produced in Rosetta (DE3) cells grown in M9 minimal medium supplemented 
with 50mg ml‘ L-SeMet (Sigma) and specific amino acids to inhibit endogenous 
methionine synthesis. The SeMet protein was purified using the same procedure as 
the unmodified protein. 

Fluorescence anisotropy titrations. Fluorescence anisotropy measurements”! 
were carried out on a Fluoro-Max-4 spectrofluorimeter (Horiba Jobin Yvon). 
Fluorescein-labelled 14 base-pair dsRNA (Dharmacon) (10 nM) was titrated with 
full-length RIG-I, helicase domain, RD or helicase-RD in 50 mM Tris-acetate (pH 
7.5), 100mM Na-acetate, 10mM Mg-acetate, 5% glycerol, 5mM DTT, 0.05% 
Tween 20 buffer. 5'-ppp dsRNA (GGAGAGAACCGCCV) was transcribed using 
T7 RNA polymerase, PAGE purified, and annealed to a complementary, fluor- 
escein-labelled RNA. Fluorescein anisotropy was measured at 25 °C with excita- 
tion at 494nm and emission at 516nm. The helicase-RD titrations were also 
performed at lower dsRNA concentrations (1 and 2nM) to obtain an accurate 
Kg. The observed fluorescence anisotropy (rops) as a function of protein concen- 
tration (P,) was fit to equations (1) and (2) to obtain the equilibrium dissociation 
constant, Ky. 


Tobs = Taf + re (1 — fo) (1) 
where r; and 1, are the anisotropy values of free RNA and of the complex, f, is 
the fraction of RNA bound in the protein-RNA complex and f, = [PR]/[R,] 


(PR is the concentration of the protein-RNA complex and R, is the total RNA 
concentration). 
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Initial anisotropy of the free fluorescein-labelled dsRNA was 0.10 for all experi- 
ments. The measurements and errors are from two independent experiments in 
Table 1. 

ATPase activity. A time course (0-30 min) of the ATPase reaction was performed 
using RIG-I helicase-RD (5nM), ATP (1 mM) spiked with [y-*P] ATP with or 
without 14 base-pair dsRNA (80 nM) in buffer containing 50 mM MOPS (pH 7.4), 
5mM MgCl, 5mM DTT, 0.01% Tween 20 at 37°C. The ATPase activity of the 
RIG-I helicase domain was measured using a higher amount of protein (100 nM) 
and RNA (1 LM). The quenched reactions were analysed by PEI-Cellulose-F TLC 
(Merck) developed in 0.4 M potassium phosphate buffer (pH 3.4). The TLC plates 
were exposed to a phosphorimager plate, analysed on a Typhoon phosphor-ima- 
ger, and quantified using ImageQuant software. The ATPase rate was determined 
from the plots of [Pi] produced versus time and the rate constant values were 
calculated by dividing the ATPase rate by the respective enzyme concentration. 
The mean rate constant from two independent measurements and range is shown 
in Table 1. 

Analytical size-exclusion chromatography. Analytical size-exclusion chromato- 
graphy was carried out on an AKTA FPLC system (GE Healthcare). Proteins were 
loaded on to Superdex 200 10/300 GL column (GE Healthcare) equilibrated with a 
buffer containing 50 mM HEPES (pH 7.5), 50mM NaCl, 5mM MgCl, 5mM 
DTT. The eluate was monitored by ultraviolet absorbance at 280nm. For the 
complex, helicase-RD was incubated with 14 base-pair dsRNA at the molar ratio 
of 1:1.2 on ice for 15 min before the sample was applied onto the column. 
Preparation of helicase-RD-dsRNA. The palindromic RNA oligonucleotide 
(5'-CGACGCUAGCGUCG-3’) (Dharmacon) was deprotected and desalted 
into 20 mM potassium phosphate (pH 7.4) before use. The dsRNA was prepared 
by incubating the RNA at 95°C for 1 min followed by gradual cooling to 4 °C. 
The resulting dsRNAs were mixed with purified helicase-RD in a RNA:protein 
molar ratio of 1.2:1, incubated at room temperature (20 °C) for 15 min, and then 
purified by size-exclusion chromatography (Superdex200, GE Healthcare) with 


an elution buffer of 50mM HEPES (pH 7.5), 50mM NaCl, 5mM DTT, 
5mM MgCl. 

Crystallization and X-ray diffraction data collection. Helicase-RD-dsRNA- 
ADPeBeF; ternary complex was reconstituted by incubating 0.17 mM helicase- 
RD with 0.17 mM dsRNA, 2mM ADP, 2mM BeCl, and 10mM NafF on ice for 
30 min before crystallization. Crystals of native and SeMet-substituted complex 
were grown by the hanging-drop vapour diffusion method at 20°C. Aliquots 
(2.5 pl) of 15 mgml ' of helicase-RD-dsRNA-ADP*BeF; complex in 50mM 
HEPES (pH 7.5), 50mM NaCl, 5mM DTT, 5mM MgCl, were mixed with 
2.5 ul of reservoir solution containing 25% (w/v) PEG 3350, 0.25M NaSCN, 
100 mM MOPS (pH 7.8), 3% (v/v) 2,2,2-trifluoroethanol. Crystals appeared after 
2-3 days, and they grew to a maximum size of 0.15 X 0.15 X 0.5mm over the 
course of 8 days. 5-iodo-uridine derivative helicase-RD-dsRNA-ADP»BeF; crys- 
tals were grown under identical conditions. For cryogenic data collection, crystals 
were transferred into crystallization solutions containing 5% (v/v) (2R,3R)-(-)-2,3- 
butanediol as cryoprotectant and then flash-cooled at 100 K. SeMet SAD data set 
was collected at the X29A beamline of the NSLS. Native data set was recorded at 
the LRL-CAT 31-ID beamline of the APS. Data from iodouridine derivative crys- 
tals were collected at the CHESS F1 beamline. All diffraction data were integrated 
using iMosflm and scaled in SCALA”. 

Structure determination and refinement. Phases were determined using the 
SAD method. SHELXD/HKL2MAP® detected a total of 12 out of 14 possible 
selenium sites in the asymmetric unit. Initial phases were calculated with 
Phaser and followed by density modification by DM**. Phase extension to 
3.2 A by SOLOMON” produced an electron-density map into which most of 
the protein and RNA residues could be built unambiguously. The model was built 
using Coot™ and refined in PHENIX”. The final model, comprising ADP*BeF;, 12 
base-pair dsRNA, RIG-I residues 240-495, 504-522, 527-687 and 690-923 has 
Rwork and Réree Values of 0.199 and 0.287, respectively. Model validation demon- 
strated no outliers and 90% of the residues located the most favourable region of 
the Ramachandran plot”. Statistics of the data processing and structure refine- 
ment are summarized in Supplementary Table 1. 

SAXS and structural modelling. SAXS data were collected at the CHESS 
beamline Gl using a Finger Lakes CCD X-ray detector with a sample-to- 
detector distance of 1,450mm, covering the range of scattering vectors 
0.01<s<0.233A', where s=4mnsin0/A (20 is the scattering angle and 
2.= 1.296 A). All samples were buffer exchanged into 50 mM HEPES (pH 7.5), 
50mM NaCl, 5 mM DTT, 5% glycerol by size-exclusion chromatography (Hiload 
16/26 Superdex200, GE Healthcare) to minimize the discrepancies in background 
subtraction. SAXS data were reduced using Data Squeeze. Various programs in 
ATSAS software package were used to process and evaluate the scattering data. 
Radius of gyration (R,) was analysed using the Guinier approximation with low 
angle data (s < 1.3/R,) using PRIMUS”. The residuals from the Guinier plots did 
not show signs of protein aggregation. The probability distribution of distances 
between scattering atoms within the macromolecule, P(r), and the maximum atom 
pair distance, Dy,ax, Were determined from the scattering data using GNOM”. The 
program DAMMIF* was used to calculate low-resolution ab initio reconstruc- 
tions from experimental SAXS profiles. Ten to twenty independent models were 
aligned, filtered and averaged based on the occupancy using SUPCOMB and 
DAMAVER?” to reconstruct the final ab initio envelope, as judged by averaged 
normalized spatial discrepancies (NSD) less than 1.0. The structure modelling was 
refined against the solution scattering data by rigid body docking using SASREF 
program”. The orientation and position of individual domains in the structure 
models were further manually adjusted to minimize the discrepancies (7*) 
between the calculated scattering intensities and experimental scattering intens- 
ities computed using program CRYSOL"*’. Kratky plots were calculated using 
Origin (OriginLab)**. 

DSF. Thermal shift assay was conducted with 10 1M of RIG-I helicase-RD or 
full-length RIG-I with or without 121M of 14 base-pair palindromic dsRNA 
and/or 2mM ADPeBeF; in 50mM HEPES (pH 7.5), 50mM NaCl, 5mM 
MgCl, 5mM DTT and a 5X dilution of SYPRO Orange dye (Invitrogen) as 
described”. The fluorescence signal as a function of temperature was recorded 
using a Real Time PCR machine (Applied Biosystems). The temperature gradient 
is performed in the range of 25-80 °C with a ramp of 0.2 °C over the course of 
60 min. Control assays were carried out with buffer in the presence or absence of 
RNA. Data were analysed with the Excel-based worksheet DSF analysis, and the 
Boltzmann model was used to fit the fluorescence data to obtain the midpoint 
temperature for the thermal protein unfolding transition (T,,) using the curve- 
fitting software Prism. 

Limited trypsin proteolysis. Limited proteolysis with trypsin (Roche) was per- 
formed using 120 1g of purified full-length RIG-I or helicase-RD in the absence or 
presence of palindromic dsRNA and incubated with trypsin at a protein:protease 
mass ratio of 300:1. The reaction mixtures were maintained at room temperature 
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and aliquots were removed at 15, 30, 60 and 120 min. The reaction was stopped by 
the addition of SDS-PAGE loading buffer and analysed by SDS-PAGE. 
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EXTRASOLAR PLANETS 


Homing in on another Earth 


The identification of the closest analogue of Earth so far, orbiting another star, suggests that small planets are common, and 
that the discovery of a candidate habitable planet in an alien star system could be just around the corner. 


JACOB BEAN 


than the Sun, astronomers’ primary objec- 

tive is to find a planet that is teeming with 
life. A milestone on the path to this goal is the 
discovery of Earth-sized planets orbiting their 
parent stars in the ‘habitable zone’ — the range 
of distances from the star at which the tem- 
perature would be just right for liquid water 
to be present on a planet’s surface. But which 
stars harbour such planets, how common are 
they, and what are their basic characteristics? 
Although astronomers can’t yet answer these 
questions, a paper to be published in Astron- 
omy & Astrophysics by Pepe et al.’ presents 
the discovery of several planets that marks a 
significant step towards changing this impasse. 

Pepe and colleagues detected five small plan- 
ets orbiting parent stars that are slightly smaller 
and cooler than the Sun. One of the planets is 
only 3.6 times the mass of Earth and is in an 
orbit that teases the inner edge of its host star’s 
habitable zone. This is the closest that astrono- 
mers have yet come to finding another Earth. 
Furthermore, the relative ease with which these 
and other previously reported small planets 
have been found by the same group implies that 
the frequency of such planets around Sun-like 
stars is on the order of tens of per cent. 

The authors made their new discoveries’ 
using the radial-velocity technique, which is an 
indirect method based on measuring the peri- 
odic change in speed of a star caused by the grav- 
itational tug of orbiting bodies. Until this year, 
most planet discoveries were made with this 
method. But there has been a widely held opin- 
ion that the radial-velocity technique will not 
be able to find candidate habitable Earth-mass 
planets despite its success at finding Jupiter- 
mass planets. The reasoning behind this is that 
the variability of the visible surfaces of stars, as 
a result of magnetic activity, pulsations and the 
turbulence of the plasma in their atmospheres, 
creates noise in radial-velocity measurements, 
and this noise is larger than the signal induced 
by an Earth-sized planet in the habitable zone. 
Also, the technical challenge of building instru- 
ments sensitive enough to detect the subtle indi- 
cations of such planets, even around perfectly 
‘quiet’ stars, was considered too challenging. 

Pepe and colleagues’ detections’ were 


IE the hunt for planets around stars other 


enabled by a combination of exquisite instru- 
mentation, painstaking analysis and an 
ingenious observational technique. Using 
an ultra-stable instrument on a dedicated 
telescope, and with the benefit of ten years’ 
experience in refining their calibration strat- 
egy, the authors set out to intensively monitor 
ten of the quietest known nearby stars. Their 
monitoring campaign involved making mul- 
tiple measurements per night to average over 
the stellar noise cycles that had plagued pre- 
vious studies. Using this approach, they have 
broken through the previously inviolate one- 
metre-per-second radial-velocity barrier and 
detected planets that have signals less than 
tenfold larger than that of an Earth analogue, 

for which the sig- 
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selves. They did this 
by demonstrating 
that the orbital periods determined for the 
planets were distinct from the host stars’ 
rotation periods, and also that there were no 
correlations between the detected signals and 
the diagnostics of stellar activity. 

Despite being a clear breakthrough, there 
are some limitations to Pepe and colleagues’ 
study. One limitation is that the newly detected 
planets are poorly characterized. We do not 
know the compositions of the planets because 
the radial-velocity method yields no informa- 
tion about the densities of the planets it detects. 
Also, we have little knowledge of how elliptical 
the planets’ orbits are (their orbital eccentric- 
ity), because the detected signals are relatively 
small. Both composition and orbital eccen- 
tricity are crucial parameters for assessing a 
planet’s habitability. 

Furthermore, although Pepe and colleagues’ 
results’, and those described in another paper 
from the same group’, hint at a large popula- 
tion of small planets orbiting Sun-like stars, 
the statistics are weak because these planets are 
difficult to detect and therefore the samples are 


© 2011 Macmillan Publishers Limited. All rights reserved 


incomplete by an unknown amount. Identi- 
fying a large sample of similar planets, and 
studying them by different methods, would 
advance the field. However, Pepe et al. show 
that the full potential of the radial-velocity 
technique has not yet been realized. This work 
warrants the building ofa generation of radial- 
velocity instruments that are even more stable 
than those now in use, so that instrumental 
noise is no longer a barrier to the discovery of 
Earth-like planets with this method. 

NASAs ongoing Kepler mission, which finds 
planets using the transit technique, announced 
a haul of more than 1,000 new planet candi- 
dates earlier this year, and is on track to 
identify habitable-zone planets’. Because the 
transit technique detects planets by measur- 
ing periodic decreases in a star’s brightness 
when a planet passes in front of it, the issue 
of stellar variability is also a limiting factor for 
this approach, and early results indicate that 
the stars Kepler is looking at are significantly 
more variable than expected*. This means that 
an extension beyond the nominal 3.5-year mis- 
sion duration might be necessary for Kepler 
to securely detect Earth-sized planets in the 
habitable zones of Sun-like stars. 

In the long run, astronomers aim to study 
the atmospheres of the small worlds revealed 
by radial-velocity and transit surveys to obtain 
further insight into the planets’ habitability and 
even, perhaps, their state of inhabitance. The 
investigations that will be enabled by NASAs 
planned James Webb Space Telescope, which is 
currently at risk of cancellation, are the corner- 
stone of astronomers’ next plans in this area. 
Nevertheless, it is clear that the search for other 
Earths is gathering pace. The exciting results 
from Kepler, and the remarkable advances in the 
radial-velocity technique demonstrated by Pepe 
et al., show that the race is well and truly on. = 
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BRIEF COMMUNICATIONS ARISING 


Can algal uptake stop NO3_ pollution? 


ARISING FROM Cardinale, B. J. Nature 472, 86-89 (2011) 


The influence of biodiversity on ecosystem function has been of interest 
to community ecologists for decades. Recently, Cardinale’ reported that 
biodiversity affects nitrate (NO; ) uptake in algal communities and 
that, as a result, biodiversity may help mitigate nutrient pollution. 
Although Cardinale’s conclusions about niche partitioning are interest- 
ing (figure 2 in ref. 1), his extension of these findings to problems of 
nutrient pollution is premature. Algal uptake is only a short-term nitro- 
gen sink; control of NO3_ pollution requires long-term solutions. 

Nitrate removal in streams typically occurs via two pathways. 
Denitrification is the microbial conversion of NO; to N, gas 
(Fig. 1). This is the most important means of permanent NO; 
removal from streams. However, it is not considered in Cardinale’s 
study. 

The second NO; uptake pathway is assimilation by primary pro- 
ducers (in this case, algae). Algae acquire nutrients to fulfil physio- 
logical demands and support growth, but these same nutrients are 
later released after cell death or disruption (such as the brushing 
treatment applied to some flumes to create heterogeneity). Algal 
uptake leads to temporary storage of nitrogen in algal tissues (turn- 
over time ranges from ~17.5-47 days**), and can have significant 
short-term effects on nitrogen export to downstream ecosystems. But, 
because benthic algae do not accumulate over long (annual or inter- 
annual) periods, this uptake mechanism represents only a transient 
storage pool and thus has no enduring effect on downstream water 


quality (Fig. 1). 
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Figure 1 | Nitrogen cycling in streams. Red arrows indicate processes not 
measured by Cardinale’. Storage in groundwater and sediments, N> fixation 
and fluxes to other ecosystems (for example, floodplains) may also occur. 


Cardinale reply 


Algae could potentially mediate permanent nitrogen removal in 
streams by facilitating increased rates of denitrification®®. Indeed, 
Cardinale’s results raise the intriguing possibility that algal diversity 
could affect denitrification rates via changes in the availability of 
nitrogen, carbon or oxygen to denitrifiers. However, until this poten- 
tial effect is demonstrated, conclusions about biodiversity effects on 
the control of aquatic nitrogen pollution are premature. 

We agree with Cardinale that nitrogen enrichment threatens the 
integrity of aquatic ecosystems and that nitrogen removal is a critical 
ecosystem service provided by streams’. And, although Cardinale has 
elucidated a potential mechanism for biodiversity to influence stream 
nutrient cycling, for this work to be extended to pollution control in 
natural ecosystems it must be tested in environments incorporating 
critical biogeochemical pathways—namely, denitrification and nitro- 
gen fixation—and it must consider long-term nitrogen dynamics. 
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REPLYING TO H. M. Baulch, E. H. Stanley & E. S. Bernhardt Nature 477, doi:10.1038/nature10418 (2011) 


Over the last 20 years, 59 experiments have quantified how the rich- 
ness of plants and algae influence concentrations of inorganic nitro- 
gen in soil or water’. Of these, 86% have shown that the concentration 
of nitrogen decreases as biodiversity increases—by an average of 48%. 
The primary contribution of my study’ was to identify a biological 
mechanism that is likely to explain these biodiversity effects. Using 
stream mesocosms, I showed that the impacts of algal diversity on 
nitrogen dynamics are controlled by niche partitioning—a long pre- 
sumed, but rarely demonstrated mechanism. Baulch, Stanley and 


Bernhardt* have questioned whether my findings have any implica- 
tions for managing water quality in ‘real’ streams, as I suggested. They 
argue that nitrogen assimilation by algae cannot influence long-term 
nitrogen retention due to high turnover of algal biomass and rapid 
recycling of nitrogen, and they suggest that the only permanent loss of 
nitrogen from a stream is via denitrification. 

Baulch, Stanley and Bernhardt’ are correct in saying that my study 
does not reveal the ultimate, long-term fate of nitrogen in streams. 
Although assimilation represents the critical first step in nitrogen 
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removal from water, it must be followed by some other process that 
leads to long-term storage or permanent removal from a stream if 
there is to be any meaningful improvement in water quality. Increased 
rates of assimilation very likely increase the probability that other 
long-term mechanisms operate, but my study only explicitly linked 
diversity to assimilation. So I agree that researchers now need to take 
the next step and examine the impacts of biodiversity on additional 
pathways involved in the nitrogen cycle. 

But as we take these next steps, I worry that Baulch, Stanley and 
Bernhardt’ have given an incomplete roadmap. From their comment, 
one might get the impression that everyone agrees on what processes 
control the fate of nitrogen in streams, and that denitrification is the 
sole process we need to focus on. I do not believe this to be the case. 

Baulch, Stanley and Bernhardt’ present what I call the ‘quasi- 
chemostat’ model of a stream, which assumes that streams are in a 
steady state such that there is no net accrual of biomass, and no net 
storage of material. As nutrients flow into a stream, they are assimi- 
lated and remineralized at comparable rates by organisms with high 
turnover. The nutrients are not transformed in any way, such that 
nitrogen simply ‘spirals’ downstream in the same biologically active 
form. They argue’ that the one exception to these chemostat-like 
assumptions is that denitrification may transfer some nitrogen from 
the stream to air. 

Many researchers, including myself, think that the quasi-chemostat 
model makes overly simplistic assumptions that are not supported by 
existing data. Dozens of '°N tracer studies have shown the following. 
First, assimilation accounts for an average of 84% of NO; removal 
from stream water*’, whereas denitrification accounts for just 16%°. 
Second, up to 67% of assimilated nitrogen can go into primary 
consumers’ (a pathway that Baulch, Stanley and Bernhardt’ do not 
consider), much of which can be lost to terrestrial environments via 
emergence or consumption by terrestrial predators*’. Third, some 
studies indicate that 25% or more of imported nitrogen can end up 
in depositional habitats’? or hyporheic zones"! where it has residence 
times on the scale of years'*”’. Last, several studies show that the vast 
majority of nitrogen that enters a stream reach is transformed and 
exported downstream in a particulate form that is less available for 
biological uptake”’”. 
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These findings all violate the assumptions that Baluch, Stanley and 
Bernhardt* have made about nitrogen cycling in streams. So although 
they make a useful point that we need to examine how biodiversity 
influences the suite of processes that influence the long-term fate of 
assimilated nitrogen, I believe that we would benefit from taking a 
broader view of the nitrogen cycle than depicted in figure 1 of Baulch, 
Stanley and Bernhardt’. 
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